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Abstract 

Solutions  of  the  general  linear  matrix  equation 
n 

Z  A4XB,  ■  C  are  obtained  and  presented  in  this  thesis.  Some 
i-1  1  1 

special  cases  do  arise  like  the  Liapanov  matrix  equation. 

Necessary  conditions  and  sufficient  conditions  are  established 
for  the  solution  of  the  general  linear  matrix  equation.  Other 
forms  of  solutions  than  those  obtained  through  the  use  of 
similarity  transformations  that  have  been  considered  make  use 
of  the  spectral  decomposition  of  matrices  and  tensor  products 
of  matrices  or  K'ronecker  products.  In  considering  the  general 
linear  matrix  equation,  linear  matrix  equations  in  which  two 
different  variables  appear  are  also  studied.  Conditions  for 
the  existence  of  a  solution  for  this  type  of  equation  are 
given.  The  theory  of  the  generalised  inverses  of  a  matrix  was 
used  in  obtaining  a  solution  to  the  general  linear  matrix  equa¬ 
tion.  More  general  forms  of  the  solution  are  given  and  condi¬ 
tions  under  which  these  solutions  exist  have  been  established. 

Solutions  to  systems  of  matrix  equations  were  also  considered. 

As  a  by-product  of  this  investigation,  some  aspects  of  the 
model  reduction  problem  may  be  treated  from  the  point  of  view 
of  matrix  equations.  In  particular,  a  new  method  of  solution 
of  the  matrix  equation  AXB  ♦  CXD  ■  E  which  was  recently  con¬ 
sidered  by  S.K.  Mitra  (Siam  Journal  of  Applied  Math.  321  and 
others  was  obtained.  Applications  of  the  results  of  this 

iv 

A 


work  are  of  use  in  the  estimation  of  variance  and  covariance 
components  of  linear  models  as  treated  by  C.R.  Rao. 
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SOLUTIONS  OF  THE  MATRIX  EQUATION 
AXB  ♦  CXD  -  E 

I .  Introduction 

Matrix  equations  are  becoming  a  more  significant  part 
of  the  formulation,  computation  and  solution  of  problems  in 
the  engineering  and  social  sciences.  The  classic  engineering 
investigations  of  the  Liapanov  equation 

AX  ♦  XB  -  C  (1.1) 

and  the  matrix  Ricatti  equation 

XDX  ♦  AX  ♦  XB  ♦  C  ■  0  (1.2) 

are  probably  the  most  commonly  known.  Solutions  to  these 
equations  and  their  more  general  forms,  given  by  the  equation 

n 

E  A. XB .  •  C  (1.3) 

i-1  1  1 

are  studied  in  this  thesis. 

Solutions  to  Eq  (1.3)  are  found  through  the  application 
of  various  methods  of  solution.  Necessary  and  sufficient  con¬ 
ditions  are  stated  for  the  case  in  which  n  ■  2  in  Eq  (1.3). 
These  conditions  can  easily  be  extended  to  cover  larger  values 
of  n.  The  conditions  given  are  predominantly  based  upon  the 
use  of  similar  matrices.  With  similarity  shown,  the  concept 
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of  pencils  of  matrices  is  used  to  generate  the  actual  solution 
to  the  matrix  equation. 

More  complex  matrix  equations  are  handled  through  the 
introduction  of  the  concept  of  spectral  decomposition.  This 
is  basically  writing  a  matrix  in  terms  of  eigenvalues  and 
idempotent  matrices.  After  some  of  the  theory  of  spectral 
decomposition  is  investigated,  it  is  extended  by  introducing 
nilpotent  matrices.  The  theory  of  solutions  is  developed  for 
square  matrices  first,  and  then  extended  to  include  rectangula 
coefficient  matrices  and  also  rectangular  variable  matrices. 
Following  this  is  a  brief  digression  to  consider  equations  of 
the  form 

AXB  ♦  CYD  -  E  (1.4) 

Equations  of  the  type  in  Eq  (1.4)  are  studied  from  the  point 
of  view  of  being  able  to  be  rewritten  into  an  equation  of  the 
type  in  Eq  (1.5)  with  n  ■  2. 

Two  other  methods  of  solution  of  the  equation  AXB  ♦ 

CXD  ■  E  are  considered.  A  brief  look  is  taken  at  how  general¬ 
ized  inverses  can  be  used  to  solve  equations.  In  doing  so,  it 
becomes  apparent  that  more  than  one  solution  can  exist  for  an 
equation.  The  theory  of  how  these  additional  solutions  are 
generated  is  developed  in  Chapter  III.  The  last  method  of 
solution  considered  is  through  the  use  of  Tensor  Analysis. 

The  second  half  of  this  thesis  develops  the  concepts 
of  generalized  inverses  by  extending  the  methods  that  are 
currently  being  employed  to  allow  a  wider  class  of  solution 


to  be  possible.  This  is  done  by  extending  the  theorems  that 
exist  to  include  more  arbitrary  matrices.  This  then  lets  the 
user  have  some  control  over  the  results  and  more  closely  fit 
the  solution  to  the  problem  at  hand.  Also  in  this  section, 
applications  of  the  theory  are  made  to  the  area  of  model 
reduction. 

The  last  portion  of  the  thesis  recapitulates  some  of 
the  ideas  of  linear  modeling  and  then  applies  the  previous 
work  to  finding  the  variance  and  covariance  matrices.  Another 
application  is  to  the  concept  of  the  reduced  order  filter. 
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II .  Solutions  of  the  Matrix  Equation 
AXB  +  CXD  3  E 

In  this  chapter  the  results  of  Roth  (Ref  38) ,  Rosenblum 
(Ref  36) ,  Mitra  (Ref  28) ,  Jones  (Ref  14) ,  and  Lancaster  (Ref 
21)  are  extended  by  use  of  procedures  taken  from  Rao  and  Mitra 
(Ref  35),  Nering  (Ref  29),  and  Browne  (Ref  4). 

Special  cases  of  the  matrix  equation 

AXB  +  CXD  =  E  (2.1) 

occur  if  B  **  C  *  I  or  A  3  D  =  I,  where  I  denotes  the  identity 
matrix.  Thus,  Eq  (2.1)  reduces  to  the  Liapanov  equation. 

Much  attention  has  been  given  to  solutions  of  the  Liapanov  equa¬ 
tion,  particularly  Gantmacher  (Ref  10),  Ma  (Ref  26),  Rosenblum 
(Ref  36) ,  and  Ziedan  (Ref  46) .  Many  others  have  also  done  work 
in  this  area.  Leuthauser  (Ref  23)  had  studied  Eq  (2.1)  in  the 
cases  where  the  matrices  are  all  taken  to  be  square.  The  re¬ 
sults  presented  in  the  remainder  of  this  chapter  mostly  pertain 
to  rectangular  matrices.  That  is,  in  general  the  solution  matrix 
X  is  an  element  of  a  class  of  matrices  that  are  of  dimension 

m  by  n.  This  will  be  denoted  by  X-  ...  The  dimensions  of  the 

(m,n) 

other  matrices  are:  A,  ,  B,  .  ,  C,  .  ,  D,  ,  and  E,  .  . 

(P»m)  (n,q)’  (p ,m)  (n,q) »  (P,q) 

Solutions  of  Eq  (2.1)  and  also  of  the  more  general  matrix 

equation 


m 

l  A  XB  -  Q  (2.2) 

i-1 
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will  be  found  through  use  of  matrix  methods,  tensor  products, 
and  spectral  decomposition  of  matrices. 


Necessary  and  Sufficient  Conditions 
for  the  Solution  of  Eg  (2.1) 

Roth  (Ref  38)  developed  his  results  for  square  matrices 
by  using  the  concept  of  similar  matrices.  Since  the  equation 
now  under  consideration  has  rectangular  component  matrices, 
the  restriction  that  the  matrices  be  similar  is  relaxed  and 
all  that  is  required  is  that  the  matrices  be  equivalent. 

Definition :  A  matrix  B  is  said  to  be  equivalent  to  a 
matrix  A  if  there  exist  nonsingular  matrices  P  and  Q  such  that 
B  -  PAQ. 


Theorem  2.1  (Necessary  Condition):  Let  X,_  denote 
-  (m,n) 


the  matrix  solution  of  AXB  +  CXD 


E  where  A,  . ,  B ,  . , 

(P  »m)  (n,q) 


^(p  m)  *  D(n  q)  *  an<*  ^(p  q)  '  t^ien  the  following  pair  of  matrices 


fA(p.m) 

E(P I 

rACp,m) 

°(p»q)‘ 

°(n,m) 

D(n,q)  ' 

°(n,m) 

D  .  (2-3^ 

U(n,q) 

equivalent. 

Proof : 

To  show  that  the 

above  matrices 

are  equivalent 

there  must  exist  two  nonsingular  matrices  P  and  Q  such  that. 


P 


(P+n,p+n) 


A(P,m)  E(p,q)l.0 

J  y  (m«-q  ,m+q) 

°(n,m)  D(n,q) 

jAp,m)  °(p  ,q)  | 

°(n,m)  D(n,q)J 


(2.4) 


5 


6 


Theorem  1  shows  that  a  necessary  condition  for  a  solu¬ 
tion  to  exist  is  that  the  matrices  A  and  D  from  Eq  (2.1)  can 
be  written  in  block  diagonal  form.  Similarly,  the  same  result 
can  be  shown  for  the  matrices  B  and  C  of  Eq  (2.1). 

Theorem  2.2  (Necessary  Condition):  Let  X^m  denote 

the  matrix  solution  of  AXB+CXD-E  where  A,  . ,  B ,  .  ,  C ,  .  , 

(p,m)  (n,q)  (p,m) 

D^n  ^  and  E^p  ^ ,  then  the  following  pair  of  matrices  are 
equivalent : 


Two  other  necessary  conditions  arise  if  in  Eq  (2.1) 
the  matrices  A  and  C  are  of  the  same  square  dimension  and  the 
matrices  B  and  D  are  also  of  the  same  square  dimension  but 
different  from  A  and  C.  These  conditions  arise  from  the  study 
of  pencils. 

Definition :  Let  A  anc  C  be  a  pair  of  square  matrices 
of  the  same  order  and  let  \  be  an  element  of  the  complex  num¬ 
bers,  then  A  ♦  XC  is  called  a  pencil. 

Definition:  A  pencil  is  considered  to  be  regular  if 
the  matrices  A  and  C  are  square  and  the  determinant  |A  ♦  AC | 
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is  not  identically  zero.  Otherwise,  the  pencil  is  singular 
(Ref  28:823) . 

Theorem  2.3:  If  A+AC  and  B  +  AD  are  regular  pencils, 
then  Eq  (2.1)  has  a  solution  if  and  only  if  the  following 
pair  of  matrices 


[(C-eA)'1 

(C-eA)  '^(B+eD)  *1' 

[(C-eA)*1  01 

*-  0 

-D(B+eD) " 1- 

Lo  -D(B+eD)  '1J 

are  similar,  where  there  exists  an  e,  a  complex  scalar,  such 
that  |-eA+C|  +  0  and  |B+eD|  +  0. 

Proof :  Since  A+AC  and  B+AD  are  regular  pencils,  the 
determinants  are  |A+AC|  /  0  and  |B+AD|  f  0.  This  implies  that 
| A+AC |  is  a  polynomial  in  A  and  has  at  most  n- zeros.  Hence, 
there  are  at  most  m  values  of  A  for  which  |A+AC|  vanishes. 

This  is  similarly  true  for  B+AD.  Choose  e  not  equal  to  any 
value  of  A  for  which  |A+AC|  ,  |B+AD|  vanishes  (Ref  8:824). 

Thus  e  is  a  scalar  such  that  | - e A+C |  f  0  and  | B+eD |  +  0,  which 
implies  (-eA+C) _1  and  (B+eD)'1  both  exist.  Thus,  the  following 
is  true: 

AXB  +  CXD  -  E  (2.13) 

AXB  +  eAXD  -  eAXD  +  CXD  ■  E  (2.13a) 

AX(B+eD)  +  (C-eA)XD  -  E 

(C-eA)'1AX(B+eD)  +  XD  -  (C-eA)*1E 

(C-eA) " 1AX  +  XD(B+eD) * 1  -  (C-eA) ’1E(B+eD)"1 

Eq  (2.13)  has  then  been  reduced  to  a  Liapanov  equation.  By 
the  results  of  Roth  (Ref  38:392),  the  matrices  in  Eq  (2.12) 
are  similar  which  completes  the  proof. 
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Theorem  2.4:  If  A+XC  and  B+XD  are  regular  pencils, 


then  Eq  (2.1)  has  a  solution  if  and  only  if  the  following 
pair  of  matrices 

f(A+eC)  ” 2C  (A+eC)  "1E(D-eB)*1‘|  (-(A+eC)'1  0-| 

*- 0  -D(B+eD)  * 1 J  ’  Lo  -D(B+eD)  1 J 


are  similar,  where  there  exists  an  e,  a  complex  scalar,  such 
that  |A+eC|  i  0  and  |D-eB|  f  0. 

Proof :  The  proof  of  Theorem  2.4  is  similar  to  the  proof 
of  Theorem  2.3.  The  major  difference  is  that  in  Eq  (2.13a),  the 
matrix  eCXB  should  be  used  instead  of  the  matrix  eAXD. 

Theorem  2.5  (Sufficient  Condition) :  Let 

AX  +  CXB*  XD  -  E  (2.14) 


where  B  is  a  nonsingular  matrix,  and  let 


A+XC  E* 

.0  -D+XB- 


A+XC  0 

.0  -D+XB 


(2. IS) 


be  a  pair  of  equivalent  matrices,  where  A+XC  and  D+XB  are 
regular  pencils  of  matrices.  Then  there  exists  a  solution  of 
Eq  (2.14). 

Proof :  By  the  results  of  Roth  (Ref  38:392),  and  since 
the  matrices  in  Eq  (2.15)  are  equivalent  with  elements  in  F(X) 
where  F  is  a  polynomial  domain  and  X  is  an  indeterminant,  then 
X (X )  and  Y(X)  are  matrices  with  elements  belonging  to  F(X). 
Hence  Eq  (2.14)  can  be  rewritten  as 


(A+XC)X(X)  -  Y (X) (D*XB)  -  E  (2.16) 
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Let 

x(x)  -  x0^vx2x2+xV-+xPxp*  p  -*’1 

(2.17) 

and 

let 

(2.18) 

Y (X)  -  Yq+XY1^X2Y2+X3Y3+.. .+XqYq,  q  <  n-1 

where  p  and  q  denote  the  number  of  matrices  with  elements  in  F. 
Since  XU)  and  Y(X)  are  of  the  same  dimension,  then  p  -  q. 

Thus  Eq  (2.16)  can  be  expressed  as 

(A.XC)  (Xq.XXj.xV-  •  •*X'V  (2  19) 

-  CY0tXYJ*X2Y2*...-*XPYp)(I>-XB)*E 

Equating  the  coefficients  of  like  powers  of  X  in  Eq  (2.19). 
the  following  P»2  equations  can  be  arrived  at: 


AXq 

AXX  ♦  CX0 

ax2  -  cxx 


YoD 

Y:D  *  YQB 
Y2D  +  YjB 


E 

0 

0 


\\  +  CXh-l*  YhD  *  Yh-1B 


(2.20) 


cx. 


YPB 


Multiplying  each  of  the  equations  of  Eq  (2.20)  by  I ,  B  D, 
(B_1D)2 ,  (B'W,...,  (B’ ^D)^+1  respectively  yields: 


AXq  "  E 

AX^B^D)  ♦  CX^B'b)  -  0 
AX2(B‘1D)2  ♦  CX1(B‘1D)2  ■  0 


(2.21) 
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(2.21) 


AXh(B'1D)h  +  CXh.1(B*1D)h  -  0 

CXpCB^D)15*1  -  0 

Adding  the  equations  of  Eq  (2.21)  and  factoring  yields: 

A[X0-*-X1(B'1D)+X2(B'1D)2*. .  .♦Xp(B*1D)p]  ♦ 

C[X0(B‘1D)-*-X1(B'1D)  2+X2(B'1D)3+. .  .+X  (B‘1D)P+1]  -  E  (2.22) 

which  implies 

A[X0+X1(B'1D)+X2(B'1D)2^.. .♦Xp(B*1D)p]  ♦ 

C[X0*X1(B’1D)+X2(B'1D)2  ♦. . .♦Xp(B‘1D)p] (B_1D)  -  E  (2.23) 

Hence,  a  solution  of 

AX  ♦  CXB* XD  -  E  (2.24) 

exists  and  the  theorem  is  complete. 

Conditions  for  the  Solution 

n 

of  the  Matrix  Equation  Z  A.XB.-C 

i»l  1  1 

Conditions  that  are  necessary  and  sufficient  for  solu¬ 
tions  of  the  general  linear  matrix  equation 

n 

Z  A.XB,  -  C  (2.25) 

i-1  1  x 

will  be  based  upon  the  ideas  of  a  spectral  decomposition  of  a 
matrix  into  a  representation  that  uses  idempotent  and  nil- 
potent  matrices. 
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Definition:  Let  N  be  a  square  matrix  with  elements  in 
a  field  F.  If  *  N,  N  is  said  to  be  idempotent. 

Definition :  Let  N  be  a  square  matrix  with  elements  in 
a  field  F.  If  there  exists  a  positive  integer  m  such  that 
Nm  *  0,  N  is  said  to  be  nilpotent. 

Definition:  Let  A  and  B  be  two  idempotent  matrices. 

Then  if  AB  *  BA  =  0,  A  and  B  are  said  to  be  orthogonally  idem- 
potent. 

Orthogonally  idempotent  matrices  are  generated  by 

spectrally  decomposing  a  matrix.  Matrices  that  are  associated 

with  different  eigenvalues  are  orthogonal. 

n 

Definition:  Let  A  *  Z  X^E..  The  representation  of 

i»l  1  1 

A  where  each  X^  is  an  eigenvalue  of  A  and  each  is  an  idem- 
potent  matrix  associated  with  the  X^  is  called  a  spectral 
decomposition. 

From  a  theorem  of  Rosenblum  (Ref  36:268),  there  exists 
the  property  that  the  sum  of  the  orthogonal  idempotent  matrices 
equals  the  identity  matrix,  I. 

As  stated  earlier,  a  considerable  amount  of  work  has 
been  done  to  find  the  solution  of  equations  of  the  type 

BX  -  XA  -  Q  (2.26) 

Most  of  this  effort  has  been  directed  towards  those  cases  in 
which  the  matrices  A  and  B  have  been  square  and  of  the  same 
dimension.  Rosenblum  (Ref  36)  has  derived  both  necessary  and 
sufficient  conditions  for  this  case.  The  next  theorem  extends 
his  work  to  the  case  where  the  matrices  are  of  different 
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dimensions . 


Theorem  2.6:  Let  A  be  a  square  matrix  of  dimension  m. 


m 


such  that  A 


£  a.E.  and  B  is  a  square  matrix  of  dimension  n, 
j-1  J  J 
n 

such  that  B  *  £  b.F,,  where  the  E.'s  and  F.  's  each  form  dis- 

K  K  3  K 

tinct  sets  of  orthogonal  idempotent  matrices.  A  necessary  and 
sufficient  condition  that  the  matrix  equation  Eq  (2.26)  have  a 


solution  X 


(nfm) 


is  that  whenever  for  some  pair  of  indices  s  and 


r,  as  *  br,  that  is  the  characteristic  roots  are  equal,  then 
FrQEs  "  °(n,m) * 

Proof :  To  show  necessity,  suppose  X. 


k(n,m) 

of  Eq  (2.26)  such  that  ag  *  br.  Then  Eq  (2.26)  implies 
n 


is  a  solution 


m 


£  (b,F.)X  -  X(  £  a.E.)  -  Q 
k-1  K  K  j-1  J  J 


(2.27) 


which  can  be  rewritten  as 

n 
£ 

k-1 


m 


£  b.F.X  -  £  a.XE. 

K  K  j-1  ]  3 


(2.28) 


Multiplying  from  the  left  in  a  termwise  fashion  by  Fr  and  at 
the  same  time  multiplying  from  the  right  in  a  termwise  fashion 
by  Eg  yields: 


brFrXEs  '  asFrXEs  *  FnQEs  C2*29) 

since  the  Es's  and  Fr's  are  orthogonally  idempotent.  Factoring 
Eq  (2.29)  yields: 


(br'as)FrXEs  "  FrQEs  (2*30) 

But  by  hypothesis  br  -  ag ,  thus  the  left  side  of  Eq  (2.30) 


13 


becomes  0^n  .  This  then  implies 

F  QE  -  0,  x 
rx  s  (n,m) 

To  show  sufficiency,  the  following  convention  will  be  used. 

If  as  ■  br,  then  FrQE$  *  0  and  ®  •  0  «  0.  Thus  the  expression 


'v*r*qEi 


(2.31) 

has  meaning  for  all  j  and  k.  Next  let  V  be  defined  as  follows 


m  n 

V  -  E  E 


FVQE, 


(2.32) 


k-1  j-1  bk'aj  k  j 

If  V  is  a  solution  of  Eq  (2.26),  replacing  X  by  V  should  result 
in  Q.  Hence, 


m  n 

BV-VA  -BE  E 
k 


«»  »  i  m  n  « 

ii  A  F*QEj '  w  j-i  ^7  FkQEj A 

n  IB FkQE  -  FkQEiA| 

1  j-1  l  ®k"aj  ) 


m 

-  E 
k- 


m 


m  „  *  Fk«Ej  *  <\V 

E  E  -  M  1 


k-1  j-1  ^k  '  3j 

;  j  bkFkQEi  •  aiFkQEi 
k-1  j-1  °k'aj 


m  n 

"  k-i  j-i  FkQEj 


m 


n 


■(  E  F.)Q  (  E  E.) 
k-1  K  j-1  J 

-  Q 
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Therefore  V  is  a  solution  of  Eq  (2.26)  and  the  proof  is  com¬ 


plete. 


Example  2.1:  The  technique  derived  in  Theorem  2.6 


will  now  be  used  to  solve  for  the  matrix  X  in  the  following 


equation: 


2  1  10  0  1 

X  -  X  0  2  1  - 

0  3j  003  4 


2  3 


2  -2 


(2.33) 


The  representation  of  A  is 


A  -  a1E1  ♦  a2E2  ♦  a3E3 


and  B  is 


B  «  b1F1  b2F2 


In  terms  of  the  values  from  Eq  (2.33),  these  become 


-10  0 
A  -  1  0  0  0 

-0  0  0 


B  -  2 


10  0-1  TO 

0  0  0  +  2  0- 
0  0  0  J  L0 

i  -i  I  To  f 

0  0  +  3  0  1 


o  o  o-i  ro  o  o 

0  -1  -1  +  3  0  0  1 

0  0  0  J  Lo  0  1 


(2.34) 


From  Theorem  2.6  the  solution  will  look  like  Eq  (2.32),  thus 


2  3  F.QE. 

c-1  j»lbk'aj 


FlQEi  FlQE3  F2QEi  F2QE: 

L*  -  .  f  t  f—  ♦  ■■  ■  ... 


Vs!  bl'a3  b2'al 


*2'a2 
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To  extend  the  results  of  Theorem  2.6  use  will  be  made 
of  the  fact  that  if  two  matrices  commute,  then  the  associated 
idempotent  representation  will  also  commute.  The  extended 
theorem  is  as  follows. 

Theorem  2.7 :  Let  A  and  C  be  square  matrices  of  dimen¬ 
sion  m,  such  that  A  and  C  can  each  be  expressed  as  a  sum  of 
products  of  eigenvalues  and  orthogonal  idempotent  matrices. 
Let  B  and  D  be  square  matrices  of  dimension  n,  such  that  B 
and  D  can  each  be  expressed  as  a  sum  of  product  of  eigenvalue 
and  orthogonal  idempotent  matrices.  Also,  let  AC  ■  CA  and 
BD  ■  DB.  Then  a  necessary  and  sufficient  condition  that  the 
matrix  equation 

BXC  +  DXA  ■  Q  (2.35) 

has  a  solution  X.  .  is  that  whenever  for  some  set  of 
(n,m) 

indices  {r,s,v,p}  the  following  holds:  csbv+ardp  *  °»  that 
the  products  of  the  characteristic  roots  sum  to  zero,  then 


H  F  QG  E  *  0,  , 

p  vy  s  r  (n,m) 


(2.36) 


Proof:  To  show  necessity,  suppose  X,  .  is  a  solution 
-  (n,m) 

of  Eq  (2.35)  such  that  csbv+ar<ip  -  0,  then  Eq  (2.35)  implies 

(  l  b.F.)X(  Z  c.G.)M  ?  d,H.  )X(  Z  a.E.)-  Q 
k-1  K  K  j«l  3  3  1-1  1  1  i-1  1  1 

Multiplying  from  the  left  by  Fy  in  a  termwise  fashion  and  also 
multiplying  from  the  right  by  Gg  in  a  termwise  fashion  yields: 


CSbvFvXGs 


m 


♦  F 


'  FvQGs  (2-37) 


Using  the  fact  that  if  two  matrices  commute,  then  their  idem 
potent  matrices  commute,  Eq  (2.37)  can  be  written  as 


csbvFvXGs  *  j,dlHl  FvXGs  ■  Fv«Gs 


Now  multiply  from  the  left  by  Hp  in  a  termwise  fashion  and 
multiply  from  the  right  by  Ef  in  a  termwise  fashion  which 
results  in 


'MWr  *  ardpHpFvXGsEr  *  HpFvQGsEr  (2.J8) 

Eq  (2.38)  then  implies 


(csVardp)HpFvXGsEr  "  HpFvQGsEr  C2-39) 

By  hypothesis  csby*ardp-  0(n>m)  and  thus  the  left  hand  side 
of  Eq  (2.39)  is  equal  to  0^n  ,  which  implies  that 


H  F  QG  E 
p  vx  s  r 


(n,m) 
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To  show  sufficiency,  use  will  be  made  of  the  convention  that 
whenever  cgbv  ■  *ardp,  then  HpFvXGsEr  "  0  and  «  *0  -  0,  thus 
the  expression 


will  have  meaning.  Define 


V 


m  n  m 
III 
i-1  k-1  j-1 


n  H1FkQGiEi 
1-1  Cjbk+aidl 


(2.40) 


and  let  V  be  a  solution  of  Eq  (2.35).  Thus  substituting  the 
spectral  representation  for  the  matrices  A,B,C,  and  D  yields: 


n 


BVODVA 


X-l 


'X‘X 


m  n  m  n  H.F.QG-E.  m 

l  l  l  l  — .  *■  1 1  l  c  G 

i-1  k-1  j-1  1-1  Cjbk  aidl  y- 1  11  W 


n  m  n  m 

l  d_H_  l  l  l  l 


n  H1FtQG.Ei 


k;l  1;1  Cjbk*aidl  ♦-! 


m 

l  a.E, 


n 


m 


m  n  m 
III 
i-1  k-1  j-1  1-1 


Wi 


n 


m 


m  n  m 

l  l  l  l 

i-1  k-1  j-1  1-1 


_  (  l  d  H  )H.F.QG.E. (  l  a.E.) 
r  -? 1 - . -  .  _ ili _  (  ? 


cjbk**iai 


(2.41) 


Using  the  properties  of  orthogonal  idempotent  matrices  and 
commutativity,  Eq  (2.41)  simplifies  to 


m 

y 

n 

y 

m 

y 

n 

y 

CibkHlFkQGiEi 

is 

i-1 

Lk 

k-1 

j-1 

1-1 

cjbk*  aidl 

m 

r 

n 

r 

m 

y 

n 

r 

aidlHlFkQGiEi 

Lt 

i-1 

fa 

k-1 

j-i 

fa 

1-1 

cibk  *  aA 
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ra  n  m  n  (c.b,  +  a.  d.  )H. F.QG .  E . 

-  IEEE  — 3— i - -  *  "  -- ,  - — LJ 

i-1  k-1  j-1  1-1  cjbk  *  aidl 


m  n  m  n 
IEEE  H, F.  QG . E • 
i-1  k-1  j-1  1-1  1  K  J  1 

m  n  m  n 

EH.  E  F,  Q  E  G.  EE. 

1-1  1  k-1  k  j-1  3  i=1  i 


Hence  V  is  a  solution  of  Eq  (2.35)  and  the  theorem  is  proved. 

In  the  more  general  case,  solutions  of  equations  of  the 

type 


E  A • XB •  -  Q 
i-1  1  1 


(2.42) 


where  the  matrices  A  and  B  are  square  and  of  different  dimen¬ 
sion  can  be  found.  To  solve  this  type  of  equation  the 
following  restrictions  would  have  to  hold: 

i)  (A^>  would  have  to  be  a  commutative  set, 

ii)  {B^}  would  have  to  be  a  commutative  set,  and 

iii)  n  m 

E  E  a.b.  =  0 
i-1  j-1  1  3 

If  these  three  restrictions  hold,  then  the  implication  is  that 


ii  in 

(  E  E.)  Q  (  E  F.) 
i-1  1  j-1  3 


Example  2.2:  As  an  example  of  Theorem  2.7,  consider 


the  equation 


BXA  +  BXA  -  Q 


(2.43) 


where  the  B  and  A  matrices  are  as  defined  in  Example  2.1.  Let 
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Then  the  solution  for  X  would  be: 


X 


3  2  3 

EE  E 
i-1  k-1  j-1 


2 

E 

1-1 


Wi 

cjbk*aidi 


The  summation  would  then  involve  summing  32  terms.  This 
process  could  easily  be  adapted  for  a  computer  solution. 

Not  all  matrices  may  be  decomposed  into  a  representa¬ 
tion  that  involves  only  idempotent  matrices.  However,  a 
matrix  of  this  type  may  still  be  decomposed  by  using  both 
idempotent  matrices  and  nilpotent  matrices.  If  E  is  an  idem 
potent  matrix,  then  let  E  be  its  associated  nilpotent  matrix 
Two  properties  of  nilpotent  matrices  are: 


i)  E.E.  -  E.  -  E.E. 

ii)  E.Ej  -  E.E.  -  0  (i  i  j)  (2.44) 


Generalizing  Theorem  2.6  to  this  more  inclusive  case 
results  in  Theorem  2.8. 

Theorem  2.8:  Let  A  be  a  square  matrix  of  dimension  m 
m<m 

such  that  A  -  E  a.E.+E.  and  let  B  be  a  square  matrix  of 
i-1  11  1 

n<n 

dimension  n,  such  that  B  -  I  b.F.+E..  The  sets  { E . } ,  (F.) 

j-1  J  J  J  1  J 

are  complete  sets  of  principal  idempotent  matrices  and  the 

sets  {E^},  {Fj }  are  complete  sets  of  nilpotent  matrices 


associated  with  A  and  B  respectively.  If  QE.^ 


EjQ  and 


XF^  ■  FjX  for  all  i  and  j,  then  a  necessary  and  sufficient 
condition  that  BX  -  XA  ■  Q  has  a  solution  X.  *  is  that  for 


‘(n,m) 


some  pair  of  integers  r  and  s,  a  -  b  ,  then  B  QA  -  0r  .  . 

5  i  rs  i  n  #m  j 

Proof:  To  show  necessity,  let  X,..  be  a  solution  of 
-  (n,m) 


BX  -  XA  -  Q 

such  that  as  -bf .  then  Eq  (2.26)  implies 


(2.26) 


it  in 

[  I  (b,F  >F,)]  X  -  X  [  t  (a ,E, ♦F. ) ]  -  Q 
j-1  J  J  J  i«l  ill 

n<n  n<n  _  m<m  m<m 

r  b.F.x  ♦  r  f.x  -  r  a  xn  -  r  xe  -  q 

j-1  J  3  j-1  J  i-1  11  i-1  1 

Recalling  the  properties  of  nilpotent  matrices  that  are  stated 
in  Eq  (2.44)  and  multiplying  from  the  left  by  Fr  in  a  termwise 
fashion  yields 

_  m'-m  m<m  _ 

W  *  V  -  J,  -  J,  Fr“i  - 

Multiplying  from  the  right  by  E^  in  a  termwise  fashion  yields 

brFrXEs  ♦  FrXBs  •  asFrXBs  -  F,.XES  -  F'rQEs  (2.45) 

Regrouping  and  simplifying  Eq  (2.45)  becomes 

(b  *  a  )  F  XE  ♦  F  (F  X  -  XE1EC  -  F  QE  (2.46) 

r  s  r  s  r  r  s  s  r^  s  v 

But  by  hypothesis  FrX  -  XES  which  implies  Eq  (2.46)  is 

(br  -«s)FrXEs  -  FrQEs  (2.47) 


Again,  by  hypothesis  bf  -  as  and  hence  the  left  side  of 
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Eq  (2.47)  is  0^n  which  implies  that 


F  0£c  “  0 ,  - 

r  s  (in  t  n ) 

To  show  sufficiency  the  convention  will  be  used  that  whenever 
as  ■  br,  then  FrQEg  “  0  and  «  •  0  -  0 .  Thus  the  expression 

Fk«Ei 

5r*T 

will  have  meaning  for  all  values  of  j  and  k. 

Let  V  be  a  solution  of  Eq  (2.26)  where 


m<m  n<n 

r  ▼ 


i 


r  Z  r--.-  -F.QE . 
i-1  k-1  ”k’ai  ^  1 


(2.48) 


Then  the  left  side  of  Eq  (2.26)  is 


BV  -  VA 


m<m  n<n  F.QE.  m>-m  n<n  F.QE. 

b(  r  r  i)  -  (  i  r  J^r±)A 

i-l  k-1  bk  ai  i-1  k-1  bk  ai 

in<m  n<n  3F.QE.  in<m  n<n  F,  QF.  .A 

I  T  r-± i  -  I  I  jf  1 

i-l  k-1  bk‘ai  i-l  k-1  bk‘ai 


(2.49) 


In  Eq  (2.49)  substitute  the  spectral  decompositions  for  A  and 
B  to  get 

m<m  _  n<n  _ 

5<m  n<n  J  <bXFX*Fx'FK«Ei  i<„  n<n  1  <%F„*Eu> Fk<5El 

*  r  r  ■  -  -  -  k1  ■  - r  r  I"1  ?  “  - 

i-l  k-1  bk"ai  k-1  t-1  bk'“l 


m<m  n<n  (b^Fj^F^QE^  m<m  n<n  F^Qta.Ej+E^) 

1-1  Ji  ill  Ji  “'VN 


Adding  and  then  factoring  out  the  common  terms  yields: 


(2.50) 


m<m  n<n  (bkFk+Fk)QE  -F^a^+E.) 
i-l  k-1  bv'ai 


Expanding  the  numerator  of  Eq  (2.50)  changes  the  equation  to 


.  Y  Y  bkFkQEi^kQEi-aiFkQEi-FkqEi 

i-l  k-1  bk*ai 


(2.51) 


Applying  the  hypothesis  and  the  properties  of  idempotent  and 
nilpotent  matrices  to  Eq  (2.51),  this  equation  simplifies  to: 


Ji  Ji  — V3! - 

m<m  n<n 

i-l  k-1  bk'ai 


m<m  n<n 
-  I  l 
i-l  k-1 


FkQEi 


n<n 

r  f 

k-1  ' 


m<m 

r 


i-l 


Hence  V  is  a  solution  of  BX-XA-Q,  and  the  theorem  is  complete. 

Example:  As  an  example  of  the  procedures  demonstrated 
in  Theorem  2.8,  let  the  matrices  of  Eq  (2.26)  he  as  follows: 


2  -1 
3  3 

«-4 


1  H 

3  -2 
1  0J 


(Ref  4:186)  and  B 


l! :] 


(Ref  29:277)  (2.52) 


[; ;  ::i 


Thus  Eq  (2.26)  becomes 
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(2.53) 


C 


f  2  -1 
3  3 

U  1 


2  -2' 
1  -1. 


The  decompositions  of  A  and  B  are: 


A  ■  ♦  Ej  ♦  *  ^2 


B  -  b1F1  ♦  b2F2 


In  terms  of  the  specific  A  and  B  of  Eq  (2.52) 


[j[-l  5  -l]U  (j[lO  10  -loll 
lH-3  3  lJi  ‘4LlO  10  -lOJ/ 


i  f  2  -  2 

Ti 10  10  -10  II*  3  i  1  -1 
T,10  10  -10J/  -3 


►  ' 


V 

111*0 
3. 


B  -  -1 


[.:  :])•  •(![:  ai 


(2.54) 


(2.55) 


From  Theorem  2.8  the  value  of  X  that  is  being  sought  can  be 
represented  as  Eq  (2.48)  as  follows 


X 


1<2  0<2 
I  I 
i-1  k-1 


Expanding  Eq  (2.56)  implies 

^  FlQEl  +  FlQE2  +  F2QEl 
h j  *8j  b^-a2  b 2  *  a ^ 


(2.56) 


(2.57) 


For  all  other  choices  of  i  and  k  the  difference  b^-a^  goes 
to  zero  and,  hence,  by  the  convention  in  Theorem  2.8  the 
quantity  goes  to  zero.  Making  the  substitutions  into  Eq  (2.57) 
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-++*irx  :::k  hi 

•  -+++L:  x :  j  n 

•  IK  I  -UK  I  :il  “•”> 

,  fO  0  01  ,  r-4  4  -41  .  T12  12  -121 

10  lo  0  oJ  ^  M  .4  4J  16  L12  12  - 1 2 J 

■•ii;  ;::i 

The  last  theorem  in  this  section  generalizes  the  results 
of  Theorem  2.7. 

Theorem  2.9:  Let  A  and  C  be  square  matrices  of  dimen- 
m<m  m<m 

sion  m,  such  that  A  ■  I  a.E.+E.  and  C  -  I  c.G. and  let 


ill 


J  J  J 


B  and  D  be  square  matrices  of  dimention  n,  such  that  B  *  E  b.F 

k-1  K 

n<n  _ 

and  D  ■  E  d.H.+H.,  where  the  sets  (E. },  {G.}  ,  {F,}  ,  and 

1«1  111  1  j  K 

{H^}  are  complete  collections  of  principal  idempotent  matrices 
and  the  sets  {E^>  ,  {G ^ }  ,  {F^}  ,  and  (H1 >  are  complete  collec¬ 
tions  of  nilpotent  matrices.  Then  a  necessary  and  sufficient 
condition  that  the  matrix  equation 


BXC  ♦  DXA  -  Q 


(2.3  5) 


have  a  solution  X,  *  is  that  whenever  for  some  set  of  in- 

(n,m) 

dices  (r,s,v,p)  where  HvFr  and  GgEp  form  nonsingular  matrices, 


1 


and 


SV*  *  br'5Cs  *  "pM  *  dvV  *  Fr«Cs  *  RvQFp  '  0 

implies  that  X  is  also  a  solution  of 
(brVdvVcsVapRv)X  +  x(brVdvV  *  W  RvXEp  "  Q 


Proof :  Let  X 


(n,m) 


be  a  solution  of  Eq  (2.35)  such  that 


1  n<n 

m<m 

\ 

r 

'k«l 

(HW 

X 

F 

j-1 

n<n 

F 

1-1 

(djHj.Rj) 

X 

1 

in<m 

F 

i-l 

(0iVFi> 

(2.59) 


Expanding  Eq  (2.59)  yields 


In<n  \  /m<m  \  / n<n  \  I m<m_ 


n<n  \  /m<m  /n<n  \  /m<m 

r  F  X  r  c.G.  ♦  f  f.  X  f  ff 
k-l  kl  lj-1  J  J  '  k-1  kl  'j-l  J 


n<n  1  fm<m  /n<n 

F  d.H.  X  Z  a . E .  ♦  F  d.H. |x 

1-1  1  ll  \i-i  1  M  \ i-i  1  1 


(2.60) 


m<m 

F  F, 


i-l 


n<n  \  fm<m  \  fn<n  /m<m 
♦  F  R.  X  F  a.E.  U  FH.  X  F  F. 
1-1  »i-l  1  11  '1-1  'i-l  1 


Multiplying  on  the  left  by  Fr  and  Hy  in  that  order  in  a  term¬ 


wise  fashion  and  then  multiplying  on  the  right  by  Gs  and  E 


in  that  order  in  a  termwise  fashion  making  use  of  commutativity 
and  the  idempote^t  properties  of  these  matrices  yields  from 


Eq  (2.60) 
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, 


HFQGE  -bcHFXGE  ♦  b  H  F  XU  E 
v  ry  s  p  r  s  v  r  s  p  rv  rs{ 

♦CHFXGE+HFXFE  ♦daHFXGE 
sv  rsp  v  rsp  vpv  rsp 

+  d  H  F  XG  F  ♦  a„R  FXG  En  «■  FT  F  XG  F 
vvr  sp  pvr  sp  vr  sp 


(2.61) 


Using  the  properties  of  nilpotent  matrices  and  factoring, 

Eq  (2.61)  can  be  reduced  to 

wVp  ■  VrKwysVap'V* 

*  WrVdvV  *  VSs  *  (2.62) 


Since  H  .F  and  G.E  are  nonsingular  matrices,  then  their  in- 
v  r  s  p  ’ 

verses  exist  and  Eq  (2.62)  can  be  reduced  to 

Q  -  (brcs+dvap+csVapfTv)X+X(brjrs+dvEp)+FrX{Ts+ITvXFp 

Which  implies  X  is  also  a  solution  of  this  equation  and 
necessity  has  been  shown.  To  show  sufficiency  the  convention 
will  be  used  that  whenever  brcg  ♦  dvap  ■  then  HvFrXGsEp  * 
and  «®  •  0  ■  0,  then  the  expression 


HvFrXGsEp 

b  c  «-d  a 
r  s  v  p 


(2.63) 


will  be  well  defined.  Let  V  be  a  solution  to  Eq  (2.63)  where 
m<m  m<m  n<n  n<n  H„F  OG 

v  -  r  r  r  r 

p*l  v»l  r-1  s*l  Drcs 

Then 


L 
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n<n 


BVC  ♦  DVA  -  I  (b.F.^F.) 

'  k-1  K  K  /  i p 


Hm<m  m<m  n<n  n<n  H,  F  QG  E„  \ 

i  r  t  r  J-O-* 

p-1  v-1  r-1  s-1  brcs  dvap  • 


m<m 


n<n 


•ijF1(cjGj‘ffjr[1F1(<iiH1*Ri 


m<m  m<m  n<n  n<n  H  F  OG„E„  \  Jm<m 

r  r  r  r  rv--CXE  r  (a.E.+E.; 
p-1  v-1  r-1  s-1  rcs  Vp J«i*l  11  1 


(2.65) 


Simplifying  Eq  (2.65)  by  multiplying  and  making  use  of  the 
properties  of  idempotent  and  nilpotent  matrices  yields: 


BVC  ♦  DVA 


m<m  m<m  n<n  n<n|(c  b  *d,  a„)  H 

-  r  r  r  r  E-.s--r 

P"1  v* 1  r-1  s-1^  brcs+dvap 


F  QG  E 
v  rx  s  p 


HvFr(c  F  Q.b  QG  ..  H  0.d,E  *F  OG  *H,QE  )G  E  1 

■g  '.d?  p  '  p  <2-66> 

r  s  v  p  J 

Utilizing  the  convention,  Eq  (2.66)  simplifies  to 

m<m  m<m  n<n  n<n 
-IEEE  »  FOG  E 
p-1  v- 1  r-1  s-1  v  r  s  p 


Which  completes  the  proof  of  this  theorem. 

Generalizing  these  results  to  equations  of  the  type 


E  A.XB.  -  Q 
i-1  1  1 


(2.67) 


where 


m 


m 


jfl(aijAiJ*AlJ>  and  Bt  ■ 
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with  {Ajj}  ,  {B.^}  forming  sets  of  complete  idempotent  matrices 

and  forming  complete  sets  of  nilpotent  matrices. 

A  necessary  and  sufficient  condition  that  the  Eq  (2.67)  have 

a  solution  X,  »  is  that  whenever  for  some  set  of  indices  the 
(n,m) 


n  m 


n  m 


n  m 


n  II  A.  .  and  n  II  B..  have  inverses  and  E  Z  b.  .X.  .0 
i-1  j-1  1J  i-1  j-1  i-1  j-i 


n  m 

♦  Z  Z  .Q®.  .  »  0,  then  X 
i-1  j-1  ^  XJ 


(n,m) 


n  m 


n  m 


is  also  a  solution  of 


n  m 


(2  H  ai.bi.  ♦  Z  Z  b..A..)X  ♦  X(  E  E  a..B.) 
i-1  j-1  1J  i-1  j-1  1J  1J  i-1  j-l  XJ 

m  m  n  m 

+  E  E  (  n  n  X..XB\.)  -  Q. 
i-1  j-1  i-1  j-1 

An  alternate  solution  to  Eq  (2.35)  exists  if  either  pair  of 
matrices  B,C  or  A,D  are  nonsingular.  If  this  condition  exists 
then  the  equation  to  be  solved  is 

X  +  B" 1DXAC* 1  -  B" ^QC* 1 


D'HxCA*1  ♦  X  -  D^QA'1 


Spectral  Decomposition  of 
Rectangular  Matrices 


In  this  section,  the  results  of  the  last  section  will 
be  generalized  to  solutions  of  matrix  equations  in  which  the 
coefficients  are  rectangular  matrices.  In  solving  these  equa¬ 
tions  use  will  be  made  of  the  following  theorem. 

Theorem  2.10:  (Rao  and  Mitra  (Ref  14:38))  Any  m  x  n 
matrix  A  can  be  written  as: 
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m 

A  «  Z  a • U .  (2.68) 

i-1  1  1 

2 

where  a^,  i«l,  2,  u  are  the  distinct  nonnull  eigenvalues 

of  A*A  and  the  matrices 

IT  -  a^Afl*  (A*A-a?I)  { (A*A-a?I)  (A*A-c^I)]  (2.69) 

satisfy 

UiUiUi*  UiVi 

UtUJ  -  0  ,  U*U  =  0  v i  +  j  (2.70) 

2 

(a^  is  taken  to  be  the  positive  square  root  of  ou  which  is 
real  and  positive  since  A*A  is  hermitian  and  non-negative 
definite . ) 

For  a  proof  of  the  above  theorem,  the  reader  is  directed 
to  the  source  cited.  The  notation  used  above  is  defined  as 
follows:  a  matrix  denoted  by  A*  is  the  conjugate  transpose 
of  the  matrix  A;  and  a  matrix  A  is  the  generalized  inverse 
of  the  matrix  A. 

Example  2.3:  As  an  example  of  Theorem  2.10  consider 

First  the  nonnull 
and  the  product 

A*  A 

Therefore , 


(2,3) 


fi  1  1] 

ll  1  lJ 


eigenvalues  must  be  found,  thus  A* 


(2.71) 


i:  a 


n  H  I1  '  ‘l-li  i  il 

Ll  1J  Li  i  i J  L2  2  2J 


(2.72) 
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\2  2  21  rl  0  01 

A*A  -  XI  -  2  2  2  -AO  1  0 

1-2  2  2J  LO  0  lJ 

F2 - A  2  2  1 

-  2  2-A  2 

12  2  2-X-l 


-X3  ♦  6X2 


(2.73) 


-  X*(X-6) 

Setting  this  last  equation  equal  to  zero  and  solving  implies 

X  -  0  or  X  -  6 

2 

Since  the  or  are  the  nonnull  values  of  X,  the  only  X  that  can 

2  2 

be  set  equal  to  a.  is  the  value  of  6.  Thus  a7  *  6  and  hence 


A  -  61L 


(2.74) 


To  find  the  matrix  Uj ,  Eq  (2.69)  must  be  solved.  Making  the 


substitution  for  or  yields 


U!  "  7gAn-(A*A-6I){(A*A-6I)2}'  (A*A-6I)]  (2.75) 


A*A- 6 1 


f2  2  21  f6  0  01  f-4  2  2l 

2  2  2  -  0  6  0  -  2  -4  2 

1-2  2  2  J  L0  0  6  J  L  2  2  -4  J 


(2.76) 


with 


{ (A*A-6I) 


2  r 24 

-12 

-121 

2  -  -12 

24 

~ 1 2  l 

L- 12 

-12 

24  J 

r  i/6 

-1/12 

-1/12 

-1/12 

1/6 

-1/12 

L-l/12 

-1/12 

1/6 

(2.77) 


(2.78) 


Substituting  Eqs  (2. 76) ,  (2. 77)  and  (2.78)  into  Eq  (2.75) 
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i 


0 

1 

0 


2 

-4 

2 


21  r  1/6  -1/12  -1/121  r-4 

2  -1/12  1/6  -1/12  2 

-  4-1  L- 1/12  -1/12  1/6  J  L  2 


2 

-4 

2 


Thus  A  can  be  decomposed  into  a  summation  of  products  of 
scalers  times  matrices,  where  the  scalers  are  determined  by 
the  eigenvalues  of  A*A. 

Definition  (Ref  35:20):  Let  A  be  an  m  x  n  matrix  of 
arbitrary  rank.  A  generalized  inverse  of  A  is  an  n  x  m  matrix 
G  such  that  x  -  Gy  is  a  solution  of  Ax  =  y  for  any  y  which 
makes  the  equation  consistent.  One  of  the  important  properties 
of  generalized  inverses  is  found  in  the  next  lemma. 

Lemma  2 . 1  (Rao  and  Mitra  (Ref  35:20)):  A  exist  if  and 
only  if  AA'A  -  A. 

Example  2.4:  Let  A  be  given  as  in  Eq  (2.71).  To  find 
the  generalized  inverse  of  A  the  procedure  outlined  by  Noble 
(Ref  30:339-341)  will  be  followed.  Let  A  be  partitioned  as 
follows : 


A 


(2.79) 
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then 


A11  "  [11  »  Ai2  "[111.  A21  -  [1]  ,  and  Au  =  [1  1]  (2.80) 


From  the  matrices 

in  Eq  (2, 

.80)  , 

the  matrices  Q, 

B,  and  C  can 

be  generated. 

Q  - 

a  *  1  a  i 

a11a12 

■  [1] 

[1  1]  -  [1  1] 

B  - 

[a11]  • 

n 

(2.81) 

LA12  J 

Li  J 

C  - 

[I  Ql  - 

[1  1 

1] 

A  formulation  of  the  generalized  inverse  is  then  given  by  the 
equation 

T  T  _  1  T*  _  1  T 

(2.82) 


CT(CCT)*1(BTB)"1BT 


where  the  superscript  T  indicates  the  transpose  matrix. 
Making  the  appropriate  substitutions 

\-l 


liii"  * » w  i.>  "Ei 

in 

*  o  a 


{[i  i  i] 

(1/3)  (1/2)  [1  1] 


[1  1] 


(2.83) 


Theorem  2.11  (Rao  and  Mitra  (Ref  35:24):  A  necessary 
and  sufficient  condition  for  the  equation  AXB  -  C  to  have  a 
solution  is  that 


AA'CB'B  -  C 


(2.84) 
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in  which  case  the  general  solution  is 


X  *  A'CB'+Z-A'AZBB'  (2.85) 

where  Z  is  an  arbitrary  matrix. 

Proof :  Let  there  exist  a  matrix  X  such  that  AXB  =  C. 

Then: 

AA'CB'B  =  aa'axbb'b 
=  AXB 
=  C 

Thus  necessity  follows.  Sufficiency  is  obvious  since  ACB* 
is  clearly  a  solution. 

Example  2.5:  Let  AXB  *  C  where  A^2  ,  B^  and 

4)  are  defined  as  follows: 

ri  l  n  ri  l  o  -1*1  (3  2  -l  oi 

A  =  ,  B  =  2  3  -1  -4  ,  and  C  = 

Ll  1  lJ  I3  4  -1  -5J  U  0  1  1J 

then  solve  for  the  matrix  X. 

According  to  Theorem  2.11,  all  that  is  required  to 
solve  for  X  is  to  set  X  equal  to  the  result  of  Eq  (2.85). 
Doing  this  will  generate  a  family  of  solutions  dependent  on 
the  choice  of  the  matrix  Z.  For  purposes  of  this  example, 
let  Z  be  equal  to  the  zero  matrix,  then  X  =  A’CB".  a"  is  as 
found  in  Example  2.4,  Eq  (2.83). 

To  find  the  generalized  inverse  of  B,  the  same  proce¬ 
dure  will  be  followed.  Checking  the  rank  of  B,  it  is  easily 
shown  that  p(B)  ■  2,  and  thus  the  partition  of  B  is 
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X 


2  -1 
0  1 


6 

-1 

-5 

-4 


3’ 

1 

2 

1-1 


which  when  simplified  yields: 


X 


f-12  6 

(1/9)  12  6 

112  6 


The  next  theorem  generalizes  the  conclusions  of  the  last 
theorem. 

Theorem  2.12:  A  necessary  and  sufficient  condition  for 
the  equation 

AXC  ♦  DXB  -  Q  (2.89) 

to  have  a  solution  is  that  RR*WL*L  -  W,  where  W  “  RR*QL*L 
and  R  and  L  are  spectrally  decomposed  matrices  multiplied 
from  the  right  and  left  respectively.  Then  a  general  solution 
of  Eq  (2.89)  is 

X  -  RWL+Z-R" rzll” 

where  Z  is  an  arbitrary  matrix.  This  solution  exists  provided 
at  least  one  of  the  following  statements  holds,  but  not  both 
i  and  ii  at  the  same  time: 


i)  U.U*D  -  0  or  BW£Wk  -  0 

ii)  YjYJA  -  0  or  CVJVj  -  0 


(2.90) 


where  the  decomposition  of  the  matrices  A,  B,  C,  and  D  are  as 
follows : 


(m,n) 


E  a.U. 
i-1  1  1 


(p.q) 


j 


Vjvj 


(p»q) 


g 

k-1  K  k 
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and 


D, 


E  «1Y1 
1-1 


(m,n) 

Proof :  If  Eq  (2.89)  is  a  true  statement,  then 

g 


i-1 


1  l 


E  a  .U .  I  Xl  E  y.G 


k-1 


k  k 


1-1 


'1*1 


E  6.  Y,  X  I  E  6 -V 


j-1 


J  J 


Multiplying  on  the  left  by  ILU*  and  from  the  right  by  GkG£ 
yields 


UiXGk 


(IKl* 


h 

E 

i-1 


W  x  ^  ViW  "  uiui^GkGk 


which  can  be  simplified  to 

U.XG.  ♦  U.U.DXBG*G.  -  U.U*QG*G, 
lk  li  |(  k  li^kk 

But  by  hypothesis  (i)  either  U^U*D  -  0  or/and  BGj^Gj,  -  0, 
then  the  last  equation  reduces  to 


U.XGk  -  U.U*QG£Gk 


Then  this  is  a  form  of  Theorem  2.11  and  the  proof  of  the 
theorem  is  complete. 

Note  that  if  the  multiplication  had  been  by  YJYj  and 
VjV^  respectively,  then  the  results  would  have  been 

VjYixvj  "  YiYiQVjvj  (2,91) 

and  either  Y^Y*A  -  0  or  CV*Vj  “  0  would  have  to  be  true  for 
the  theorem  to  hold.  Thus  there  exists  four  distinct  ways 
for  this  theorem  to  hold. 

Example  2.6:  Consider  the  equation, 
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(2.92) 


c :  M  •]•[.: :  im*  k  .;i 

r1 

which  corresponds  to  Eq  (2.89).  Then  A*  -j^l  lj 
Eq  (2.88)  A'  -  (1/6)  j^l  lj 
Decomposing  A  yields 

A  -  a.U.  -  (/ff)(l//5)[J  l  \] 


and  from 


(2.93) 


which  implies 


and 


and  Ui  -  (!/✓&)[}  \  J] 


UU*  -  (1/2)  Q  J] 

Following  the  same  procedure  for  the  matrix  D  implies 


(2.94) 


■1 

D*  - 

0  ] 

n  -  r1 

and  D  -  (1/4) 

r 

H 

O 

.1 

0  ] 

lJ  ll 

0  1J 

Decomposing  D  yields 

D  < 

which  implies 


6kYk  "  (2D  d/2) 


H 


(2.95) 


6.  -  2  and  Y,  -  (1/2) 


thus 


Y*Y  -  (1/2)  [J  }] 


(2.96) 


Substituting  the  decompositions  of  A  and  D  from  Eqs  (2.93) 
and  (2.95)  respectively  into  Eq  (2.92)  yields 
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ri  1  n  ri  n  r  1  o  -it  ri  ii  ri  3*1 

(/S)(l//S)  X  (2)  (1/2)  0  0+  X  1  1  - 

Ll  l  iJ  li  lJ  l-i  o  lJ  Li  lJ  I2  -lJ 


Multiplying  on  the  left  by  UU*  from  Eq  (2.94)  and  on  the 
right  by  Y*Y  from  Eq  (2.96)  yields 

‘jt: :  ;i>  iic  :i 

•h:  :it:  :m  ac  ;i 

-  (1/2) (1/2)  [J  i)  [|  (1  }]  (2.97) 

In  simplifying  this  last  equation  (2.97),  the  second  term  on 
the  left  goes  to  zero,  thus  allowing  the  use  of  Theorem  2.12. 
The  simplified  form  of  Eq  (2.97)  is  then 


i: ;  :h;  a 


+  o 


(5/4) 


a 


(2.98) 


Utilizing  Theorem  2.12  the  solution  is 


~c :  :rc  an  ir 
e  :  in:  i  ;h;  i 


♦  z 


In  Eq  (2.99),  letting  Z  ■  0  implies 


1 

0 

1J 


(2.99) 


(5/24) 


[!  s  ;i 

Li  o  iJ 


and  if  Z  ■ 


f 3  4  -1], 

L-i  -3  iJ 


then  the  X  is  Eq  (2.99)  becomes 
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r  17  48  171 

X  -  (1/24)  65  96  -55 

U31  -79  17  J 

Solution  of  the  Matrix  Equation 

~~'AXr~'CYD  »  F  - - 

In  this  section  equations  of  the  type 

AXB  +  CYD  -  E 

will  be  considered.  The  theorems  to  follow  will  be  proved 
for  cases  in  which  the  variable  matrices  are  the  same.  After 
the  proof  is  complete,  those  changes  that  are  necessary  for  a 
proof  of  the  two  variable  case  will  be  given. 

Returning  to  the  work  of  Roth  (Ref  38) ,  it  should  be 
noted  that  his  results  are  all  stated  for  cases  in  which  the 
variable  matrices  are  square.  In  the  next  four  theorems  these 
results  will  be  extended  to  the  more  general  case  in  which  the 
variable  matrices  are  rectangular. 

Theorem  2.13:  A  necessary  and  sufficient  condition 
that  the  matrix  equation 

AXIj  -  I2XB  «  C  (2.100) 

where  1^  and  I2  are  identity  matrices  and  A  and  I2  are  square 
matrices  of  dimension  m  and  B  and  1^  are  square  matrices  of 
dimension  n,  all  with  elements  in  the  field  F,  is  that  the 
matrices 

[o  b]  “"d  (o  b]  <2-101> 

be  similar. 

Proof:  To  show  similarity,  there  must  exist  matrices  P 
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’B  SK1  -B  i) 


(2.102) 


Let  P  ■ 


pOn.m)  IX(m,n)  and  p-l  B  ^m.m)  'XI(m,n) 
L°(n,m)  1  (n,n).  L°(n,m)  ^n.n). 


Substituting  the  choice  for  P  and  P*1  into  Eq  (2.102)  yields 

f1  (n>»m)  IX(m,n)l  fA(m,m)  C(m,n)l  f^m.m)  'XI(m,n)l  fA  °1 

°(n,m)  1 (n,n) J L0(n,m)  B(n,n)  ■*  ^(n.m)  ^n.n)-*  ^  B** 


IAI  - IAXI+ICI+IXBI  A  0 

m 

0  IBI  OB 


(2.103) 


But  from  Eq  (2.100)  -IAXI+ICI+IXBI  is  equal  to  0.  Therefore, 
Eq  (2.103)  can  be  rewritten  as 


s  k  a 


‘A  -AI  C‘ 

r  A- AI  0* 

and 

-0  B- AI  • 

10  B-AIJ 

To  show  sufficiency,  reliance  is  made  on  the  fact  that 
since  the  matrices  in  Eq  (2.101)  are  similar,  then  the  follow¬ 
ing  pair  of  matrices 


(2.104) 


whose  elements  are  in  F[x]  will  also  be  similar.  From  Roth's 
Lemma  (Ref  38:392)  there  exists  matrices  X  and  Y  such  that 

X  -  XQ  ♦  XXX  ♦  A2X2  ♦  ...  ♦  AnXn 

Y  ■  YQ  ♦  AYj  ♦  A2Y2  ♦  . . .  ♦  AnXn 


(A  -  AI?)  X  -  Y  (B  -  AI.)  -  C 


(2.105) 


Substituting  the  values  for  X  and  Y  into  Eq  (2.105)  yields 


(A-AI2) (X0+AX1+A2X2+. . .+AnXn) Ix 

-I2  (Y0+AY1+A2Y2+...+AnYn)(B-AI1)  =  C  (2.106) 


After  multiplying  out  Eq  (2.106)  and  then  equating  like  powers 
of  A,  the  following  set  of  A+2  equations  is  generated. 


AXqIi 

-w 

=  c 

AX1I1  ♦  I2XqI1 

" I2YlB  +  12Y0ll 

*  0 

AVi  +  'ihh 

-i2Y2B  *  IjYjIj 

=  0 

(2.107) 

^Vl  +  Vn-l1!  'W  +  1  Vl1!  =  0 


!2Vl 


+  Wl 


All  of  the  1^  and  I2  in  Eq  (2.107)  are  square  and  thus  can  be 
dropped  from  the  equation  without  loss  of  generality.  Now 
multiply  each  row  of  Eq  (2.107)  by  I,  B,  B2,  B3,  ...,  Bn+1 
respectively  and  sum  the  members  of  the  resulting  equations. 
After  factoring  the  result  is 


A(Xq+X1B+X2B2+. . .+XnBn) 


(XQ+X1B+X2B2+. . ,+XnBn)B  =  C  (2.108) 


Multiplying  from  the  right  and  left  by  I2  and  1^  respectively 
yields 

A(Xq+X1B+X2B2+. ..+XnBn)I1-I2(X0+X1B+X2B2+. . .+XnBn)B  -  C 
which  therefore  implies  that  a  solution  of  Eq  (2.100)  exists 


and  is 


X  -  Xn+X.B  ♦  X-B2  +  X,B3  +  ...  ♦  X  Bn 
0  12  3  n 

By  an  entirely  similar  argument  it  can  be  shown  that 
A(Yq+Y1B+Y2B2+. . .+YnBn)I1-I2(Y0>Y1B+Y2B2+. . .+YnBn)B  *  C 
also  implies  that  a  solution  of  Eq  (2.100)  exists  and  is 

x  •  Yo  *  Y1B  ♦  Y2b2  *  V3  *  •••  *  V" 

If  Eq  (2.100)  had  been 

AXIX  -  I2YB  -  C 

then  the  proof  of  the  theorem  would  have  been  identical  except 
that  the  matrices  in  Eq  (2.101)  would  be  equivalent,  with 
matrices  P  and  Q  being  defined  as 


1  Y1  r1 

-xr 

and  Q  - 

LO  I  lo 

I. 

The  next  theorem  extends  the  last  result  by  changing 
one  of  the  identity  matrices  into  a  matrix  that  is  not  an 
identity. 

Theorem  2.14:  A  necessary  and  sufficient  condition 
that  the  equation 

AX  -  DXB  »  C  (2.109) 

where  A  and  D  are  m  x  m  matrices  and  B  is  an  n  x  n  matrix  with 
elements  in  F,  have  a  solution  X  with  elements  in  F  is  that 
the  pair  of  matrices 
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r  cma  i 

lo  bJ  Lo  bJ 


(2.110) 


be  equivalent. 


Proof.  Since  the  matrices  of  Eq  (2.110)  are  to  be 
equivalent,  then  there  must  exist  matrices  P  and  Q  such  that 


Therefore  let 


-C3--C3 
■  i:  “l  -  -c;] 


(2.111) 


which  transforms  Eq  (2.111)  into 


ri  -DX-irA  ciri  xi  i-a  o- 

Lo  i J  Lo  bJlo  iJ  Lo  b. 


(2.112) 


Upon  simplification,  this  becomes 


IAI  IAX+ICI -DXBI 


]■  c  :i 


(2.113) 


Since  the  identity  matrices  in  Eq  (2.113)  are  all  square,  no 
changes  occur  during  multiplication,  thus,  for  example,  IAI-A 
Also  making  use  of  Eq  (2.109),  it  should  be  noted  that 
AX  ♦  C  -  DXB  *  0.  Hence  Eq  (2.113)  becomes 

[A  °HA  °] 

lo  bJ  lo  bJ 

Since  the  necessary  condition  has  been  established,  the  matrix 
pair 
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A-AD  Cl  fA-AD  01 

and  (2.114) 

0  B-AIJ  L0  B-AIJ 

will  also  be  equivalent  with  elements  in  F[x]. 

Following  the  same  procedure  as  in  Theorem  2.14, 

Roth's  Lemma  (Ref  38:392)  states  that  there  exists  matrices 
X  and  Y  such  that 

X  .  X0.XXl*X2X2....-A"Xn 

and 

Y  -  YQ*AY1*A2Y2*...AnYn  (2.115) 

Thus  Eq  (2.109)  becomes 

(A-AD) (X0+AX1+A2X2*. . .♦AnXn) 

-  D(Y0+AY1*A2Y2^. . .AnYn) (B-AI)  -  C  (2.116) 

Upon  expansion  and  equating  terms  with  like  powers  of  A  the 

following  system  of  A  ♦  2  equations  is  formed. 


AX0 

-dy0b 

AX1  - 

a 

X 

o 

-  DY1B 

*  V 

> 

X 

to 

i 

DXX 

-  dy2b 

*  V 

-  C 

-  0 
-  0 


AXn  •  DXn  1 

n  n- 1 


DY  B  ♦  Y  .1-0 
n  n- 1 


DX 


n 


♦  V 


(2.117) 


2 

Upon  multiplying  the  system  in  Eq  (2.117)  by  I,  B,  B  . 

respectively,  then  adding  columnwise  and  factoring  out 
like  terms  yields 


A(XQ+X1B+X2B2+.  .  .♦XnBn) -D(X0-*-X1B-*-X2B2+.  .  .♦XnBn)B  -  C 
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This  then  implies  that 


Xft*X,B+X..B2+XTB3«-. .  .*X  Bn 
o  l  2  3  n 

is  a  solution  of  Eq  (2.109),  which  completes  the  proof. 

By  similar  arguments  it  can  be  shown  that  necessary 
and  sufficient  conditions  exist  for  a  solution  of  the  equation 


AXE  -  XB  -  C 

In  showing  these  conditions  exist,  the  matrix  pair 


(2.118) 


‘A- XI 

C 

and 

A-X I  O' 

.0 

b-xeJ 

■0  B-XE- 

are  seen  to  be  equivalent  and  the  system  of  equations  to  be 
solved  is 


AX0E 

AXjE  -  IXqE 
AX2E  -  IXXE 


-  V 

-  Y1B  +  Y0E 
-  Y2B  -  YXE 


AXnE  -  IX_  ,E  -  Y_B  ♦  Y_  ,E 


n-1 


-  IX  E 
n 


n 


.  i 

n- 1 
*  YnE 


»  C 
-  0 
«  0 

-  0 
-  0 


(2.119) 


The  X  ♦  2  equations  of  Eq  (2.119)  are  then  multiplied  from 
the  right  by  I,  A,  A*-,...,  An+1  respectively.  The  result  is 
that  Yq+AY1A“Y2+A3Y . . . +AnYn  is  also  a  solution  of  Eq  (2.118) 
If  Eq  (2.119)  had  been  expressed  as 

AX  -  DYB  -  C 

then  the  method  of  solution  and  the  proof  of  Theorem  2.14 
would  still  be  valid.  To  show  equivalence,  the  matrix  P 


would  have  to  be  changed  to 


p  -dyi 

lo  i . 


and  the  remainder  of  the  proof  follows  in  a  similar  fashion. 

Theorem  2.15:  A  necessary  and  sufficient  condition  that 

AXE  -  DXB  =  C  (2.120) 

where  A  and  D  are  square  matrices  of  dimension  m  and  B  and 
E  are  square  matrices  of  dimension  n,  all  with  elements  in  F, 
have  a  solution  X  with  elements  in  F  is  that  the  matrix  pair 


A  Cl  (A  01 

and  i 

0  BJ  Lo  bJ 


(2.121) 


be  equivalent  and  that  either 

i)  E  be  nonsingular  and  BE  =  EB 
or  ii)  D  be  nonsingular  and  AD  =  DA 

Proof:  To  show  equivalence  there  must  exist  matrices 
P  and  Q  such  that 


(2.122) 


Let 


rA 

C1 

rA  °i 

p 

Q 

and 

Lo 

B  J 

Lo  BJ 

fl  DX 

1 

r1  -xEi 

P’lo  , 

1  and 

Q  -  j 

Lo  i  J 

becomes 

fl  DX) TA 

cirr 

-XE)  fA  01 

-  ■ 

Lo  I  J  Lo 

B  JLo 

ii  Lo  bJ 

(2.123) 
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Simplifying  and  realizing  that  multiplication  by  an  identity 
matrix  leaves  the  original  matrix  unchanged  yields 


[•A  -  AXE  +  C  ♦  DXBT 

o 

< 

& — 

lO  B  1 

lo 

(2.124) 


Using  Eq  (2.120),  Eq  (2.124)  can  be  simplified  to 


< 

>■  — 

1 

\  O' 

Lo  E 

>J  l( 

3  B . 

To  show  sufficiency,  reliance  will  be  made  upon  what  has  just 
been  proved.  Since  the  matrices  of  Eq  (2.121)  are  equivalent, 
then  the  matrix  pair  of  Eq  (2.125)  will  also  be  equivalent. 


fA-AD  C-| 

rA-AD  E- 

V 

- "> 

*< 

i 

CQ 

O 

LO  B-AE. 

(2.125) 


Utilizing  Roth's  Lemma  (Ref  38:392),  there  exists  matrices 
X  and  Y  .vhere  X  and  Y  are  as  expressed  in  Eq  (2.115).  Making 
appropriate  substitutions  in  Eq  (2.120)  yields  the  equivalent 
equation 

(A-AD)(X0'MX1  +  A2X2  +  ..  .+XnXn)E 
-D(Y0+AY1^A2Y2>. . .♦AnYn) (B-AE)  -  C  (2.126) 

Multiplying  out  Eq  (2.126)  and  then  equating  coefficients  of 
like  powers  of  A  yields  the  system  of  equations 

AX0E  -  DYqB  -  C 

AXjE  -  DX0E  -  DYjB  ♦  DYQE  •  0 

AX2E  -  DXXE  -  DY2B  ♦  DYjE  -  0  (2.127) 
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AX  E  -  DX„  ,E  -  DY  B  ♦  DY  ,  E  ■  0 
n  n- 1  n  n- 1 

-  DX  E  +  DY  E  ■  0 

n  n 

Next  multiply  each  of  the  X  *  2  equations  of  Eq  (2.127  by 
I,  E* XB ,  (E^B)2,  (E^B)3,  ...  (E*  1B)n+1  respectively  and 
then  sum  columnwise  to  get  as  a  result 

A[X()+X1E‘1B+X2(E'1B)2^.  ..♦Xn(E*1B)n]E 
-D[X0'*,X1E1B+X^(E1B)2+.  .  .♦Xn(E*1B)n]B  -  C 

which  implies  that 

X0+X1E'1B>X2(E'1B)2+. . ,+X  (E_1B)n 

is  also  a  solution  of  AXE  -  DXB  ■  C  which  completes  the  proof. 
Similar  results  hold  when  Eq  (2.127)  is  multiplied  by 

I,  AD'1,  (AD*1)2,  (AD*1)3 . (AD_1)n  respectively  to  yield 

the  equation 

-DtYjj+YjAD'^Y-jtAD'1)2*.  .  . +Yn ( AD* 1 ) n] B 
♦A(Y0+Y1AD'1+Y2(AD'1)2+. . .♦Yn(AD'1)n]E  -  C  (2.128) 

Eq  (2.128)  can  then  be  arranged  to  be  in  the  form  of  Eq  (2.120) 
which  then  implies  that  a  second  set  of  solutions  can  be 
generated  by 

Yq+YjAD'^Y^AD'1)2*.  .  .♦Yn(AD'1)n 

The  following  example  incorporates  the  technique  used 
in  the  three  previous  theorems.  To  fit  this  example  to  any 
of  the  individual  theorems,  appropriate  matrices  would  need 
to  be  deleted. 
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Stating  Eq  (2.120)  as 


AXE  -  DYB  -  C 


only  has  minor  effects  upon  the  method  of  proof  given  in 
Theorem  2.15.  The  matrices  can  be  shown  to  be  equivalent  if 
P  is  changed  to 

I  DY 
0  I 

and  the  rest  of  the  proof  follows  as  discussed  in  the  theorem. 
Example  2.7:  Consider  the  equation  AXIj^  -  I2YD  *  E, 

where 


C  !]■ 11  ■ 

°i.  .2*C 

°1  r 

,  D  «  1 

l|  ,  and  E  •  (" 

,  -n 

1  L 

J  L-] 

L  -1J 

Thus  the  equation  of  the  form  expressed  in  Eq  (2.126)  is 

(A-AI)(X)I1  -  I2  (Y) (D-XI)  -  E  (2.129) 

After  making  appropriate  substitutions,  the  resulting  equa¬ 
tion  is 


ri-x  0 

U  -A 


A 


which  simplifies  to 


(1-A)x1  O' 

ViCl-A)  yx 

"o  -1* 

-Ax2  0 

_y2(1*x)  y2. 

;  1  -1_ 

(2.130) 


Eq  (2.130)  gives  rise  to  the  solution  of  four  equations  in 
four  unknowns.  These  equations  are: 
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( 1  ~ x) x x  -  y1(l-x) 

•  *1 

-xx2  -  y2(i-x) 

-yz 


o 

*i 

-l 

-l 


This  system  of  equations  then  quickly  reduces  to 


(2.131) 


X1  “  yl  »  yl  “  1  »  y2  “  1 

and  upon  further  simplification,  the  result  *  1  is  achieved. 
Thus  the  solution  to  this  equation  is 

*-[}]  »»<>  If  ’ll] 

The  next  theorem  generalizes  the  last  results  to  cases 
in  which  the  coefficient  matrices  are  non-square.  In  addition, 
the  variable  matrices  will  also  be  non-square.  In  this 
theorem  the  proof  will  be  in  terms  of  two  variable  matrices 
X  and  Y,  but  with  minor  changes  the  theorem  is  applicable  to 
equations  in  which  the  variables  are  the  same. 

Theorem  2.16:  Let  A(]n_n),  B(p  q).  and  C(mq) 

be  matrices  with  elements  in  the  polynomial  domain  F[x]  of  a 
field  and  let  ^  and  I2  be  identity  matrices.  A  necessary 
and  sufficient  condition  that  the  matrix  equation 

AXI x  -  I2YB  -  C  (2.132) 

have  a  solution  X,  . ,  Y,_  .  with  elements  in  F [ x 1  is  that 

(n,p)’  (n,p) 

the  matrices 

r  ci  r  °i 

Lo  bJ  lo  bJ 
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be  equivalent. 


Proof :  To  show  equivalence  there  must  exist  matrices 


P  and  Q  such  that 


P[A  c]q.[A  °] 

Lo  bJ  lo  bj 


(2.133) 


The  choices  of  P  and  Q  in  Eq  (2.133)  are 


I  I  Y 

«  (m,m)  (m,n)  (n,j 

-°(P»m)  ^P.l 


q  „  f"1  (m»m)  'X(n,p)  ^p.q,) 


(q.n) 


(q»q) 


Thus  Eq  (2.133)  becomes  upon  substitution 


ri  iytta  ciri  -xn  fa  o] 
L  I  j Lo  B j [o  I  J  lo  bJ 


(2.134) 


Performing  the  multiplications  reduces  Eq  (2.134)  to 


TA  -AXI+C+IYB 1  r  A  0] 

Lo  bJ  lo  bJ 


(2.135) 


Using  Eq  (2.127),  Eq  (2.130)  can  be  reduced  to 


A  0i .  rA  °i 

•  0  BJ  Lo  BJ 


which  proves  necessity.  Sufficiency  will  be  proved  with  the 
aid  of  the  idea  of  regular  pencils.  Thus  Eq  (2.132)  can  be 


written  as 


(A-XDXIj  -  I2Y(B-XI)  -  C 
The  matrices  X  and  Y  can  also  be  written  as 


(2.136) 


X  ■  Xn*XX.+X2X,f. ..♦XPX 
U  1  2  p 


52 


and 


Y  -  Yn-*-XY.*X2Y,f. .  .+XPY 

0  1  i  p 

Multiplying  out  Eq  (2.137)  and  then  equating  like  power  of  X 
yields  the  following  system  of  X  ♦  2  equations 


AX0I1 

AVr  l*oh 

AX2I1- 


i2yo B  -  c 
ViB  ♦  Vo1  -  0 
X2Y2B  *  Vl1  "  0 


AVr  IXn- 11 1  *  1 2YnB  *  Vn-11  “  0 


*  IXn* 


*  Vn1 


(2.138) 


Multiply  each  equation  of  the  system  in  Eq  (2.138)  by  1^ 

I(q,P)(BI)(p,p)’  1 (q »P) (BI } 2 (P »P) * ’ ‘ *  r(q,P)(BI)(p!p) 
respectively,  then  sum  the  resulting  equations  and  factor 
common  terms  to  yield: 

A[X0>X1(BI)^X2(BI)2^...+X2(BI)2]I(pp) 

-I[X0.Xl(BI).X2(BI)2.....Xn(Bnnl(BI(q  p)]  ■  CI(  , 


The  next  step  is  to  multiply  from  the  right  by  the  identity 
"'atril  ’(p,q)  t0  get 

A(X0*X1CBI).X2(BI)2.....Xn(BI)n)I(l,,Il)I(p,q) 


-I|X0.XlCBI).X2(BI)2.....XntBI),'l(BICqip))I(Pi, 


CI(m,p)‘(p,q) 


(2.139) 


Simplifying  Eq  (2.134)  the  result  obtained  is 


S3 


A[X0+X1(BI)+X2(BI)2+. . .+Xn(BI)n]I1 
-I2[X0+X1(BI)+X2(BI)2+. . .+Xn(BI)n]B  -  C 

This  implies  that 

X0+X1(BI)+X2(BI)2+. . .+Xn(BI)n 

is  a  solution  to  Eq  (2.132),  which  completes  the  proof. 

Similar  results  could  be  achieved  if  instead  of  elimin¬ 
ating  the  Y  terms  from  Eq  (2.138),  the  X  terms  were  eliminated. 

2 

This  is  accomplished  by  multiplying  by  I,  IAI ,  I(AI)  ,  ..., 
I(AI)n+1  respectively.  The  solution  generated  for  Eq  (2.132) 
is 

Y0+(IA)Y1+(IA)2Y2+. . .+(IA)nYn 
Example  2.8:  A  solution  is  desired  to 

AXIj  +  I2XD  -  E  (2.140) 

where 


■1  0  c 

n  r 

i  r1 

i — 

o 

o 

.0  0  c 

,]  *  X2  “  t 

1  i 

X  iJ 

D  -  [1  1J  ,  and  E  -  ^ 

Thus  an  equation  of  the  type  in  Eq  (2.132)  needs  to  be  solved, 
and  this  can  be  written  as 
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-  r 0  °] 

lo  1  oJ 


0 

OJ 


ryn  yn 


13 


y21  +  y22  ♦  y23 
^31  y32  y33 


['-x  A  ■[.“  **]  <2-i4i> 


Simplifying  this  becomes 


( !•  X)  x1 


12 


‘13J 


L  '  X  ^X21+Xx??+X  x2 


‘22  "  ~23J 

2.  ,  „  .,2. 


(i-x)  (yn*xy12*x  y13)  yn+xy12+x  y13 


1C!-X)  Cy21*Xy22+x2y23)  y21  +  Xy22+x2y23  J 


l:  ::i 


142) 


Eq  (2.142)  can  then  be  written  as  four  equations  in  six  unknowns 


(l-X)(x11+Xx12*X2x13)-(l-X)(y11+Xy12+X2y13)  -  0 


X(x2i+Xx22+X  x 2 3^ " ^ " x ^ ^y 2 i+xy ^ ? * x  y?o 


-1 


22  V”  (2.143) 

(yll+Xy12+x“yi3)  "  -1 

(y21+Xy22+x2y23}  "  ‘l 


The  system  of  equations  in  Eq  (2.143)  can  then  be  quickly 
solved  to  get 


12*x2>'13  ' 

1 

X11+Xx12+x2x13  "  1 

22*x2y2i 

1 

x21  +  Xx22«-X2x23  ■  1 

(2.144) 

This  implies  that  the  solution  vectors  are  X  -|1  and  Y 

■  xJ  Ly. 

where  x  and  y  are  arbitrarily  chosen. 


,[11  and  V  .m 
■  XJ  Ly  J 


Solutions  Generated  With  the 
Use  of  Tensor  Analysis 

This  last  section  attacks  the  problem  of  solutions  of 
equations  of  the  type  discussed  previously  in  terms  of  tensors. 
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In  so  doing,  the  work  of  Lancaster  (Ref  21)  will  be  generalized. 

Before  beginning,  some  basic  definitions  are  needed. 

Definition:  If  A,_  and  B,_  are  complex  matrices, 

-  (m,n)  (p,q) 

then  the  tensor  product  of  A  and  B  written  A  ©  B,  where  A  © 

B  is  a  complex  matrix,  is  defined  to  be  the  partitioned  matrix 


The  order  of  A  ©  B  is  mp  x  nq. 

Definition:  Let  X  be  a  matrix  of  order  m  x  n,  then  Xc 

is  the  column  vector  of  order  mn  x  1 ,  formed  by  writing  all 

the  elements  of  X  in  a  columnar  fashion  starting  with  x ^  and 

working  across  the  first  row,  the  second  row,  and  so  on  until 

all  x^j  have  been  exhausted. 

Lancaster  has  shown  (Ref  21:544)  that  if  A^m  , 

B,  »  I,  »  and  I-  where  BXI  -  I, XI  -  Q,  then  an 

(n’n)  1(m,m)  2(n,n)  2 

4,  4  'T* 

equivalent  expression  is  Gx  ■  q,  where  G  ■  (B  ©  Ij)  - 
(I2  ©  AT).  Note  that  BXIj  -  I2XA  ■  Q  could  just  as  easily 
have  been  expressed  without  the  use  of  the  matrices  I ^  and  I,. 
Example  2.9:  Consider  the  matrix  equation 

AX  -  XB  •  Q  (2.145) 


where 


Eq  (2.145)  could  then  be  expressed  as 


AXIj  -  I2XB 


where 


I, 


fi  0  0]  [1  0] 

0  1  0  and  I,  ■  j 

Lo  0  lJ  L  to  1 J 


Then  using  Lancaster's  results  a  solution  to  Eq  (2.145)  by 
solving 


[(A  ®  Ix)  -  (I2  ®  B)  ] x  -  q 
Substituting  yields 


(2.146) 


r 

r1 

0 

0- 

-1 

0 

°1 

- 

-1 

0 

o- 

1 

0 

oi 

X11 

r  • 

1 

2 

0 

1 

0 

1 

0 

1 

0 

1 

0 

2 

0 

0 

0 

2 

0 

X12 

2 

Lo 

0 

1. 

-0 

0 

1. 

. 

-0 

1 

3. 

.0 

1 

3  J 

X13 

3 

*i 

0 

0- 

-1 

0 

o- 

-1 

0 

o- 

-1 

0 

01 

X21 

4 

0 

0 

1 

0 

3 

0 

1 

0 

0 

0 

2 

0 

1 

0 

2 

0 

x22 

2 

J) 

0 

1. 

.0 

0 

1, 

- 

.0 

1 

3- 

.0 

1 

3  J 

fec23J 

•2 

(2.147) 


Simplification  yields 


0  0 

0  0 

•1  -1 

0  0 

0  0 

0  0 


1 

0 

0 

2 

0 


0 

1 

0 

0 

1 


0  -1 


X11 

~1 

x12 

2 

X13 

- 

3 

X21 

4 

X22 

2 

X23_ 

-2 

(2.148) 


Multiplying  out  Eq  (2.148)  to  form  the  system  of  six  equations 


L*X 

X 

,*x 


21  " 

1 

2x21  "  4 

22  " 

2 

X22  “  2 

(2.149) 

23  * 

3 

"X22  “  '2 
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Solving  the  system  expressed  in  Eq  (2.149),  the  values  are 


-  -1,  xn  -  2,  x 2 2  -  2,  and  -x^-Xj^j 


3 .  Thus  our 


choices  for  x^2>  x^j,  and  can  be  arbitrary. 

From  this  example  another  set  of  solutions  could  have 
been  generated  through  the  use  of  generalized  inverses. 

The  last  theorem  will  take  the  results  of  Lancaster  and 
extend  them  to  the  case  where  the  coefficient  matrices  are 
non- square. 

Theorem  2.17:  Let  the  matrices  A,  B,  C,  D,  Q,  and  X 
be  complex  matrices  which  have  the  following  dimensions: 

A(m,n)*  B(p ,q) '  C(m,n) »  D(p,q)'  Q(m,p) ’  and  X(n,q)‘  If 
T 

nq  ■  mp  and  A  A  j  0,  then  the  matrix  equation 


AXB  ♦  CXD 


(2  150) 


is  equivalent  to 


where 


G  ■  ATA  ©  BBT  ♦  ATC  ©  BDT 


(2.151) 


(2.152) 


Proof:  This  proof  utilizes  the  results  of  Lancaster 
(Ref  21:544)  for  square  matrices.  Thus  the  first  operation 
is  to  convert  Eq  (2.150)  to  one  containing  square  coefficients. 
Eq  (2.150)  then  becomes 


AT(AXB)BT*AT(CXD)BT  -  ATQBT 


(ATA) X(BBT) ♦ (ATC) X(DBT)  -  atqbt 


(2.153) 


Applying  Lancaster,  there  exists  a  matrix  G  defined  as 


S8 


(2.154) 


G  ■  (ATA)  ®  (BBT)T  ♦  (ATC)  ®  (DBT)T 
-  ATA  ©  BBT  ♦  ATC  ®  BDT 


Eq  (2.154)  is  the  desired  result. 

Example  2.10:  Consider  the  equation 


AXIj  ♦  I2XB  -  Q 


(2.155) 


where 


A  '  [l  •  *1  ■  C  I  0°  l)  •  ’  t1  "] 

r1  °  i  °i  r  , 

B  "  [2  J  x  J  »  and  Q  -  [-4  8  -2  4j 


To  find  G,  the  following  are  needed: 


T 

A*A 


c  ;i  •  '.'i  ■[::]■ 

■  c  :i 


ati. 


■2  0* 

.1  0. 


and  I. 


Thus  G  is 


G  *  ATA  x  IjlJ  ♦  ATI2  x  IjB1 


(2.156) 


Since  the  equation  was  changed,  the  value  for  Q  is  also 
changed,  thus 


T  T 
A^B 


Solving  for  the  G  of  Eq  (2.156)  and  thus  Eq  (2.151): 


59 


•E  :i'E  :I 

2 

i  a  •[:  i 

4  4  4  0 

0  4  0  4 

;E  3  ‘l  3 

♦ 

1 

.E  3  *E  3 

8 

2  2  2  0 

0  2  0  2 

Eq  (2.151)  can  then  be  expressed  as 


4  4  4  0 

X11 

-12 

0  4  0  4 

x12 

20 

2  2  2  2 

X21 

s 

-6 

1 - 

o 

o 

K> 

1 _ 

x22_ 

_  10_ 

(2.157) 


which  then  implies 


X11  +  x12  +  X21  =  *3 


X12  *  X22 


(2.158) 


The  two  equations  in  Eq  (2.158)  then  imply  that  any  number  of 
arbitrary  solutions  can  be  found  to  satisfy  Eq  (2.155). 
Generalized  inverses  could  also  have  been  used  to  solve  this 
problem. 


60 


III.  Generalized  Inverses  and 


Linear  Models 

Solutions  of  the  equations  being  discussed  can  also 
be  arrived  at  through  the  use  of  generalized  inverses. 
Theorem  2.11  is  a  case  in  point.  By  using  this  theorem,  a 
whole  family  of  solutions  can  be  generated  by  use  of  the 
general  solution 


X  =  A'CB'  +  Z  -  AA* ZBB"  (3.1) 

where  Z  is  an  arbitrary  matrix.  The  first  portion  of  this 
chapter  will  deal  with  more  general  solutions  than  the  one 
expressed  in  Eq  (3.1).  These  results  will  extend  the  work 
of  Rao  and  Mitra  (Ref  35) . 

One  advantage  that  is  gained  by  solving  matrix  equa¬ 
tions  through  the  use  of  generalized  inverses  is  that  a 
complex  system  of  equations  may  then  be  broken  down  and  re¬ 
duced  to  a  set  of  more  elementary  equations.  After  solving 
this  elementary  set  of  equations,  the  solution  gotten  are 
then  joined  to  find  a  common  solution  which  is  the  solution 
of  the  original  system. 

The  second  half  of  this  chapter  deals  with  applications 
of  the  methods  developed  to  the  study  of  linear  models.  The 
development  begins  with  a  linear  model  of  the  form 

Y  *  XB  +  U£  (3.2) 


61 


where  8  is  an  unknown  parameter  and  £  is  a  hypothetical  ran¬ 
dom  variable  with  a  given  dispersion  structure  but  containing 
unknown  variance  and  covariance  components.  These  two  com¬ 
ponents  will  then  be  estimated  by  use  of  MINQUE  (Minimum  Norm 
Quadratic  Unbiased  Estimation).  After  the  initial  development, 
more  complex  models  will  be  studied.  This  second  half  extends 
Rao  (Ref  34). 

Generalised  Inverses 

In  the  evaluation  of  a  matrix  equation  many  solutions 
are  possible.  The  problem  that  arises  is  that  some  of  these 
solutions  may  be  overlooked.  To  solve  the  equation 

AXR  -  C  (3.3) 

which  is  the  conclusion  of  Theorem  2.11,  use  is  made  of  the 
solution  in  Eq  (3.1).  However,  even  this  general  form  of 
solution  may  omit  a  vast  quantity  of  solutions.  This  omission 
is  corrected  in  the  first  theorem. 

Theorem  3.1:  A  necessary  and  sufficient  condition  for 
Eq  (3.3)  to  have  a  solution  is  that 

AA'CB'B  -  C  (3.4) 

in  which  case  the  general  solution  is 

X  -  A ' CB " ♦ Z -  A ' AZBB " ♦ ( I  -  A  *  A) V ( I  - MM" ) ♦ ( I  - N  *  N) W ( I  - BB  * )  (3.5) 

whc**c  M,  N,  V,  W,  and  Z  are  arbitrary  matrices  of  appropriate 
order  and  1  is  the  identity  matrix. 

Proof :  Let  X  be  an  arbitrary  matrix  such  that  Eq  (3.3) 

is  satisfied.  Then  if 
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aa'cb'b  -  m'cb'b 


from  Eq  (3.4) 

AA*(AXB)B'B  -  aa'cb'b 

(AA'A)X(BB'B)  -  AA'CB'B 

.  .  (3.6) 

AXB  -  AA  CB  B 

C  -  aa'cb'b 


Sufficiency  follows  by  letting  X  be  as  expressed  in  Eq  (3.5). 
Multiplying  from  the  left  by  A  and  from  the  right  by  B  yields 


AXB  -  AA'CB*B+AZB-AA'AZBB'B+A(I-A*A)V(I-MM')B 

♦  A(I-N'N)W(I-BB')B  (3.7) 

But 

AA'AZBB'B  -  AZB  (3.8) 

and 

A(I-A'A)V(I-MM*)B  -  (AI-AA'A)V(I-MM*)B 

-  (A- A) V ( I -MM' ) B  (3.9) 

-  0 

which  implies  similarly  that 

A(I-N*N)W(I-BB')B  -  A(I-N'N)W(IB-BB'B) 

-  A(I-N'N)W(B-B)  (3.10) 

-  0 

Substituting  Eq  (3.8),  (3.9),  and  (3.10)  into  Eq  (3.7)  yields 

AXB  -  AA'CB'B  -  C  (3.11) 

which  implies  that  Eq  (3.5)  is  a  solution  of  Eq  (3.3). 

In  the  sciences  it  often  occurs  that  a  system  of  matrix 


equations  must  be  solved  in  such  a  way  as  to  yield  a  common 
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solution.  Thus,  if  a  common  solution  to  the  system  of 
equations 

AX  -  C 

XB  -  D  (3.12) 

is  needed,  one  can  be  found  through  the  applications  of  the 
next  theorem. 

Theorem  3.2  (Rao  and  Mitra  (Ref  35:25)):  Let  A^m  , 
C(m,p) *  B(p ,q) ’  and  D(n,q)  be  given  matrices’  A  necessary 
and  sufficient  condition  that  the  consistent  system  of  equa¬ 
tions  expressed  in  Eq  (3.12)  have  a  common  solution  is  that 

AD  -  CB  (3.12a) 

in  which  case  the  general  expression  for  a  common  solution  is 

X  -  A‘ODB'-A'ADB'-*-(I-A'A)Z(I-BB')  (3.13) 

where  Z  is  an  arbitrary  matrix. 

Proof :  Let  X  be  a  common  solution  to  the  system  of 

equations  in  Eq  (3.12)  then 

AX  -  C  and  XB  -  D 
which  by  appropriate  multiplications  becomes 

AXB  -  CB  and  AXB  -  AD  (3.14) 

Thus  by  setting  the  two  equations  of  Eq  (3.14)  equal  to  each 
other  yields  the  result  wanted  in  Eq  (3.12a). 

Sufficiency  follows  by  letting  X  be  expressed  as  in 
Eq  (3.13).  Thus 

AX  -  C 

A[A'C+DB‘-A‘ADB**(I-A'A)Z(I-BB*)]  -  C  (3.15) 
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AA*C+ADB'-AA'ADB'+A(I-A*A)Z(I-BB')  ■  c 

AA‘C+ADB'-ADB"+(AI-AA'A)Z(I-BB‘)  «  c 

AA‘0(A-A)Z(I-BB‘)  -  C  C3.15) 

aa‘c  -  c 

The  hypothesis  of  the  theorem  states  that  the  equations  are 
consistent,  thus  AA*C  ■  C  and  X  is  a  solution  of  the  first 
equation  of  the  system.  Similarly, 

XB  -  D 

[A'C+DB"-A'ADB'+(I-A'A)Z(I-BB')]B  -  D  (3.16) 

A'CB+DB*B-A'ADB'B+(I-A‘A)Z(I-BB')B  -  D 

Making  use  of  Eq  (3.12)  this  last  equation  can  be  written  as 

A’CB+DB'B-A'CBB'B+(I-A'A)Z(IB-BB'B)  -  D 
A'CB+DB'B-A‘cB+(I-A'A)Z(B-B)  =  D 

DB'B  -  D 

Again,  since  the  equations  are  consistent,  DB"B  ■  D  which 
implies  X  is  a  solution  of  the  second  equation  of  the  system. 
Thus  X  is  the  sought-after  common  solution. 

One  application  of  Theorem  3.2  is  that  it  can  be  used 
to  solve  equations  of  the  type 

AX  +  XB  -  E  (3.17) 

To  make  use  of  this  theorem,  the  matrix  E  must  first  be 
expressed  as  a  sum  of  two  other  matrices.  Thus,  if 

E  -  C  ♦  D  (3.18) 

the  matrix  equation  in  Eq  (3.17)  can  be  written  as  the  system 
in  Eq  (3.12)  and  Theorem  3.2  can  be  applied.  This  procedure 


can  be  generalized  to  solutions  of  equations  of  the  type 


AXB  +  DXE 


(3.19) 


where  G  *  C  ♦  F.  The  method  is  as  in  the  next  theorem. 

Theorem  3.3:  Let  A,_  ,  B,_  ^  ,  C,_  ,  D,_  , 

-  (P»n)  (n,q)  (p,q)’  (s,m)’ 

E,  «... ,  and  F,  «...  be  given  matrices.  A  necessary  and  suffi- 
(n,t)  (s,t) 

cient  condition  for  the  system  of  equations 


AXB  -  C 
DXE  -  F 


(3.20) 


to  have  a  common  solution  is  that 

i)  C  -  AA'CB'B 

ii)  F  -  DD'FE'E  (3.21) 

iii)  A'A(D'FE‘)BB*  -  D'D(A‘CB') EE' 

in  which  case  the  general  expression  for  a  common  solution  is 

X  -  A'CB'+D‘FE'-A‘A(D*FE')BB' 

+  (I-A~A)V(I-EE~)+([-D"D)W(I-BB")  (3.22) 

where  V  and  W  are  arbitrary  matrices. 

Proof :  From  the  definition  of  a  generalized  inverse 
it  is  true  that  for  a  matrix  H,  HHK  «  H.  Thus,  if 

AXB  -  C 

it  follows  that 

(AA'A)X(BB'B)  -  C 

AA‘(AXB)B‘B  -  C  (3.23) 

AA'CB’B  -  C 

Thus  i  of  Eq  (3.21)  is  shown.  A  similar  argument  is  used  for  ii. 
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DXE  -  F 


(DD'D)X(EE'E)  -  F 
DD*(DXE)E'E  -  F 
DD'FE'E  «  F 


(3.24) 


What  has  been  shown  thus  far  is  that  for  i  and  ii  of  Eq  (3.21) 
each  of  the  equations  of  the  system  in  Eq  (3.20)  has  an  indi¬ 
vidual  solution  X.  Condition  iii  follows  from  the  application 
of  Theorem  3.1.  A  solution  for  the  first  equation  of  the 
system  in  Eq  (3.20)  is  if  the  arbitrary  matrices  Z,  V  and  W 
are  the  zero  matrix, 


Xl  -  ACB*  (3.25) 

The  solution  for  the  second  equation  of  the  system  is,  if  V 
and  W  are  the  zero  matrix. 


X2  -  D  FE  +  Z-D  DZE  E 


(3.26) 


Let  X^  of  Eq  (3.25)  be  the  Z  matrix  of  Eq  (3.26).  Thus 


X2  -  D  FE  +A  CB  -D  DA  CB  E  E 


Now  reversing  the  process,  let 


Xj  -  A'CB  +U-A  AUB  B 


(3.27) 


(3.28) 


and 


X2  -  D'FE 


(3.29) 

Then  let  X2  of  Eq  (3.29)  be  the  U  matrix  of  Eq  (3.28)  to  yield 
X1  -  A'CB‘+D'FE'-A'AD'FE'B*B  (3.30) 


if  the  system  has  a  common  solution  Xj  of  Eq  (3.30)  equals  X2 
of  Eq  (3.27).  Hence,  X^  *  X2  implies 


67 


-  DA'CB‘E+DD'FE*E-DD'D(A'CE^|EE*E+D(I-A'A)V(I-EE')E 

♦D(I-D'D)W(I-BB')E 

-  DA*CB‘E+DD'FE‘E-DA'CE'E+D(I-A'A)V(IE-EE*E) 

(3.33) 

♦(DI-DD~D)W(I-BB)E 

-  DD*FE_E+D(I-A*A)V(E-E)+(D-D)W(I-BB‘)E 

-  DD'FE'E 

-  F 


Thus  X  is  also  a  solution  to  the  second  equation  of  the  system 
in  Eq  (3.20).  This  then  implies  that  X  is  a  common  solution. 

Other  conditions  can  be  developed  for  the  system  of 
equations  in  Eq  (3.20)  to  hold  simultaneously.  One  of  these 
is  formalized  in  the  following  corollary. 

Corollary  3.3.1:  A  necessary  condition  for  the  system 
of  equations  given  in  Eq  (3.20)  to  have  a  common  solution  is 
that  the  matrices  A,  B,  C,  D,  E,  and  F  be  as  defined  in 
Theorem  3.3,  i  and  ii  of  Eq  (3.21)  hold  and  that  the  matrices 
C  and  F  may  be  also  defined  as  follows: 


C  -  adfe’b 
F  -  DA'CB'E 


(3.34) 


Proof:  From  Theorem  2.11  a  solution  for  AXB  »  C 
exists  and  is 

Xj  -  A*CB  ♦Z-AAZBB*  (3.35) 

Also,  a  solution  for  DXE  »  F  exists  and  is 

X2  -  D‘FE‘*Y-D*DYEE‘  (3.36) 

In  Eqs  (3.35)  and  (3.36)  the  matrices  Z  and  Y  are  arbitrary. 
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A  CB  +D  FE  -A  AD  FE  B  B 


D'FE‘+A'CB‘-D*DA'CB'E'E  (3.31) 


This  can  be  simplified  to 

a‘ad'fe'b*b  -  D‘DA'CB‘E'E 


which  is  iii  of  Eq  (3.21). 

To  show  that  a  common  solution  for  X  exists,  let  X  be 
as  defined  in  Eq  (3.22)  and  X  will  be  a  common  solution  if  it 
satisfies  both  equation  of  the  system  in  Eq  (3.20). 


AXB  -  A[A‘CB'+D'FE‘-A‘A(D*FE')BB"+(I-A'A)V(I-EE') 

+(I-D‘D)W(I-BB')1B 

-  AA‘CB'B+AD'FE'B-AA‘A(D'FE‘)BB‘B+A(I-A*A)V(I-EE*)B 

♦A(I-D‘D)W(I-BB')B 

(3.32) 

-  AA'CB'B+AD*FE'B-A(D'FE')B+(AI-AA‘A)V(I-EE‘)B 

♦A(I-D‘D)W(IB-BB'B) 

»  AA‘CB‘B+(A-A)V(I-EE‘)B+A(I-D‘D)W(B-B) 

-  aa'cb'b 

-  c 


Thus  X  is  a  solution  for  the  first  equation  of  the  system. 
Similarly, 

DXE  -  D[A‘CB‘+D'FE'-A‘A(D‘FE‘)BB‘*(I-A‘A)V(I-EE‘) 

♦(I-D'D)W(I-BB*)]E 

■  DA‘CB'E+DD‘FE'E-DA'A(D*FE*)BB*E+D(I-A'A)V(I-EE*)E 

♦D(I-D‘D)W(I-BB')E 

Using  iii  of  Eq  (3.21),  the  last  equation  can  be  written  as 


L 
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d'fe'  *  a'cb*+d'fe'-a'ad'fe'bb' 

0  =  ACB" -A* ADFEBB* 

A'CB*  *  A'AD'FE'BB'  (3.40) 

AA'CB'B  =  AA’AD'FE'BB'B 
C  -  AD'FE'B 


In  going  from  the  fourth  to  the  fifth  line  of  Eq  (3.40),  use 
was  made  of  i  in  Eq  (3.21)  of  Theorem  3.3.  To  show  the  second 
half  of  Eq  (3.34),  let  X2  be  as  defined  in  Eqs  (3.37)  and 
(3.39).  Thus 

A'CB'  *  d'fe'+a’cb'-d’da'cb'ee’ 
o  «  d‘fe'-d'da'cb'ee‘ 

D’FE'  -  D'DA'CB'EE’  (3.41) 

DD'FE'E  -  dd'da'cb'ee'e 
F  -  DA'CB'E 


Similarly  use  was  made  of  ii  in  Eq  (3.21)  of  Theorem  3.3,  and 
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the  corollary  is  proved. 

Example  3.1:  Consider  a  system  of  equations  as  defined 
in  Eq  (3.20)  where 


■■Hi:  !]•«■[ 

"•c ; ; 


8  81 
8  8 
8  8 


(3.42) 


To  see  if  a  common  X  is  possible,  the  conditions  of  Corollary 
3.3.1  will  be  checked.  Thus, 


DA'CB'E 


x  i  ■  ir  b  o c  ir  ci— 

Noble  (Ref  30:342)  states  that  if  all  the  elements  of  an  m  x  n 
matrix  Q  are  unity,  then  A*  -  (l/mn)AT.  Therefore 


(1/12) 


111 
111 
111 
■1  1  1 J 


and  B"  -  (1/4) 


r  i 

ii  ij 


(3.44) 


Thus  Eq  (3.43)  becomes 


**  c : :  a 


n  i  n 

iii 
i  i  i 
u  1  1J 


8  siri 
8  8 
L8  8 


ic :  :i  ■  □ 


(3.45) 


Checking  the  other  condition 


C  -  AD  FE  B 


where 
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yields 


(1/8) 


n 

1 

1 

L.1 


and  E'  *  (1/2)  [l  l] 


;  i  j][i  j][;]t .][;  ;]•[ 


■8  8 

8  8 

.8  8 


From  Theorem  3.3  the  value  of  the  common  solution  X  is 


x  *  a'cb’+d'fe'-a'ad’fe'bb' 

where  the  arbitrary  matrices  are  zero. 


(3.46) 


A  CB' 


(TI)  (T} 


D  FE' 


8  8 
8  8 
18  8 


][: :] 


!  il  g  &  ij 

Li  i-lLJ 


ri  r 

l  l 
l  l 
l  i- 


n  i 

l  l 
l  l 
Li  i. 


(3.47) 


(3.48) 


a-ad-pb-b.-  .  ; ;  j][|  j  j][j  j][j  ijf  ij 


■i  n 

i  i 
i  i 
■i  i 


(3.49) 


Making  the  appropriate  substitutions  from  Eqs  (3.47),  (3.48), 
and  (3.49)  in  Eq  (3.46)  the  common  solution  is 


n 

i 

i 

i 


n 

i 

i 

Ll 


n 

i 

i 

1J 


n 

i 

i 

i 


ri 

i 

i 

.i 


(3.50) 
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Some  applications  of  Theorem  3.3  and  its  corollary  are 
to  the  areas  of  model  reduction  and  filtering  theory.  The 
next  example  illustrates  model  reduction. 

Example  3.2:  Let  AXB  *  C  be  a  model  of  some  phenomenon 
in  which  the  matrices  A,  B,  and  C  are  defined  as 


1  i  I' 

1  2  3 

111 
^  T  ?• 


,  B 


1 

7 

1 

I 

1 

.4 


1 

2 

1 

I 

1 

T 

1 

5 


,  and  C 


g  a 


(3.51) 


Then  a  possible  choice  of  solution  for  AXB  =  C  is  &Xb  where 

aXb  =  ACB '  (3.52) 


Solving  for  the  generalized  inverses  of  A  and  B  yields 


A 


126 

-99 

-120 


-180 

204 

234 


B 


32976 

-45720 


-18072  -23568 

38640  46260 


-22860 

43920_ 


(3.53) 


Thus  X,  can  be  written  as 
a  d 


aV 


> 


126  -180 
-99  204 

-120  234 


llf 32976  -18072 
1JU5720  38640 


-23568  -22860" 
46260  43920 

« 


“  (7T693T) 


688176 

-1338120 

-1452816 


-1110672 

2159640 

2344752 


-1225368 

2382660 

2586888 


-1137240 

2211300 

2400840 


(3.54) 


A  reduction  of  this  model  could  be  described  by  the  equation 
DXE  *  F  where  the  matrices  D  and  E  are  defined  as 
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D  -  [1  1/2  1/3]  ,  E 

The  matrix  F  is  determined  by  the  equation 


r  1  1 
1/2 
1/3 
Ll/4 


(3.55) 


F  «  DA'CB'E 

Thus,  making  the  appropriate  substitutions 

It 


(3.56) 


(A) 


126  -180 
-90  204 

-120  234 


1  1  ^TTTST)^ 


r  32976  -45720 
-18072  38640 

-23568  46260 

1-22860  43920  J 


-,T  rp 


*  (7T55TT^378468*51 

-  [1] 

The  reduced  equation  DXE  *  F  can  then  be  written  as 


1 

7 

l 

I 

\J 


(3.57) 


[1  1/2  1/3]  X 


1  1 
1/2 
1/3 
L1/4J 


[1] 


(3.57) 


To  solve  Eq  (3.57)  the  generalized  inverses  of  D  and  E  are 
needed. 

°  *  [12]  3nd  E  *  t144  72  48  36]  (3.58) 

Solving  Eq  (3.57)  for  the  matrix  X  yields 


dXe  "  (^[j8 

tl] 

t144  72 

48 

36] 

(3.59) 

1  ' 

~5148 

2592 

1728 

1296' 

2592 

1296 

864 

648 

1728 

864 

576 

432 
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Eq  (3.59)  is  a  solution  for  the  reduced  equation.  If  there 
existed  interest  in  the  common  solution,  it  could  be  found  by 
application  of  Eq  (3.22).  For  purposes  of  this  example,  let 
the  arbitrary  matrices  V  and  W  be  the  zero  matrix.  Thu~ .  the 
common  solution  is 


X  =  A'CB'+D'FE'-A'A(D'FE")BB'  (3.60) 


This  could  also  be  expressed  as 

X  a  aXb  +  dXe  '  A'AdXeBB'  (3.61) 


The  values  of  aXb  and  dXe  are  known  for  Eqs  (3.54)  and  (3.59). 
What  needs  to  be  found  is  the  value  of 


A  A.X  BB 
d  e 


(3.62) 


1 

72  6 

-6 

A'A  = 

(73) 

6  37 

36 

-6  36 

37 

and 

■10116  1248 

-438 

-9001 

BB’ 

-  f  1  •> 

1248  3844 

3636 

3210 

(3.63) 

'■n>369J 

-438  3636 

3709 

3360 

L  -900  3210 

3360 

6429  J 

Thus  the  value 

of  Eq  (3.62)  is  the  product 

of  the 

matrices  in 

Eq  (3.63)  and 

Eq  (3. 59). 

The 

result 

then 

is 

[2592 

1296 

864 

6481 

A'A.X  BB' 
d  c 

=  (TITOT?) 

1296 

643 

432 

324  « 

dXe  (3-64> 

L  864 

432 

288 

216  J 

This  then  implies  that  the  common  solution  expressed  in  Eq 
(3.61)  is 
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X 


X.  +  .X.  -  .x„ 
a  b  d  e  d  e 


x  “  _  X. 
a  b 


(3.65) 


and  the  value  for  aX^  is  given  in  Eq  (3.54). 

Solutions  to  system  with  more  equations  can  also  be 
solved  by  applications  of  the  above  methods.  These  solutions 
are  based  upon  appropriate  choices  of  the  arbitrary  matrices. 

Theorem  3.4:  Let  the  matrices  A,  B,  C,  D,  E,  F,  G,  H, 
and  K  be  of  appropriate  dimension.  A  necessary  and  sufficient 
condition  that  the  system  of  equations 

AXB  -  C 

DXE  -  F  (3.66) 

GXH  -  K 


have  a 

common  solution  is  that 

i) 

C  -  aa'cb'b 

ii) 

F  *■  DD'FE'E 

iii) 

K  -  GG'KH’H 

(3 

iv) 

C  -  AD'FE’B 

v) 

G* KH*  -  A'AG'KH'BB'  -  D  FE" 

in  which  case  the  general  expression  for  a  common  solution  is 

X  -  A'CB'+D‘FE'+G'KH'-A'A(D'FE*-*-G*KH*)BB‘  (3.68) 

Proof :  The  proof  of  i  and  ii  follow  from  Eqs  (3.23) 
(3.24).  To  show  iii,  let  GXH  *  K.  Then  from  the  definitit'i 
of  generalized  inverse  it  follows  that 
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(GG'G)X(HH*H)  -  K 

GG'(GXH)H'H  -  K  (3.69) 

GG'XH'H  -  K 

This  then  proves  iii.  Section  iv  of  Eq  (3.67)  follows  as  in 
Eq  (3.40)  of  the  Corollary  to  Theorem  3.3.  Part  v  of  Eq 
(3.67)  follows  from  an  application  of  Theorem  3.3.  A  solution 
for  DXE  -  F  and  GXH  ■  K  exists  and  is 

Xj  -  G'KH'fD'FE'  (3.70) 

where  the  arbitrary  matrices  are  taken  as  the  zero  matrix. 

From  Theorem  2.11  a  solution  for  AXB  ■  C  exists  and  is 

X2  -  A'CB'+Q-A'AQBB'  (3.71) 

Let  the  Q  of  Eq  (3.71)  be  the  value  of  X^  from  Eq  (3.70). 

Then 

X2  -  A'CB'+D‘FE'+G‘KH'-A'A(D'FE'+G*KH‘)BB'  (3.72) 

Eq  (3.72)  is  the  common  solution  of  the  system  in  Eq  (3.66) 
if  it  exists,  and  Xj  ■  X2  which  implies  that  X,  is  a  solution 
of  DX2E  -  F. 

dx2e  -  DA'CB'E+DD*FE'E+DG'KH'E-DA'A(D'FE‘+G'KH')BB'c 

-  da'cb'e+f*dg'kh'e-da'ad'fe"bb'e-da*ag‘kh*bb'e 

(3.73) 

-  DA'CB  E+F+DG*KH'E-DA'CB'0-DA'AG'KII'BB*E 

-  f+dg'kh‘e-da*ag'kh'bb'e 

But  DX2E  ■  F,  thus  Eq  (3.73)  becomes 

F  -  F*DG'KH‘E-DA*AG'KH*BB'E 

which  implies 
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(3.74) 


DG'KH'E  -  DA'AG'KH'BB'E 
G’KH‘-  A‘AG'KH*BB‘ 

Also,  since  Eq  (3.72)  is  a  common  solution  GX2H  *  K.  Thus 

K  *  G[A*CB'+D'FE-+G*KH"-A'A(D*FE'+G‘KH')BB‘]H 

-  GA'CB"H+GD‘FE‘H+GG'KH'H-GA*AD*FE'BB'H-GA'AG'KH'BB'H 

*  ga‘cb'h+gd'fe'h+k-ga'cb'h-ga*ag'kh'bb'h 

(3.75) 

0  -  gd'fe'h-ga'ag'kh'bb’h 
gd'fe'h  -  GA’AG'KH'BB’H 
d‘fe  -  a'ag'kh'bb" 

The  conclusions  from  Eqs  (3.74)  and  (3.75)  then  imply  v  of 
Eq  (3.67).  The  proof  of  sufficiency  follows  by  letting  X  as 
defined  in  Eq  (3.68)  be  a  common  solution  for  the  system  of 
equations  in  Eq  (3.66).  Thus  X  must  satisfy  each  of  these 
equations . 

AXB  -  A[A‘CB'+D‘FE'*G'KH‘-A'A(D‘FE'+G'KH')BB']B 

=  aa'cb*b+ad*fe'b+ag"kh"b-aa'ad‘fe*bb‘b-aa'ag*kh'bb*b 

-  AA*CB'B+AD‘FE'B+AG'KH*B-AD'FE'B-AG'KH'B  (3.76) 

-  AACBB 

-  C 

This  last  statement  is  from  hypothesis  i,  Eq  (3.67).  X  is, 
therefore,  a  solution  to  the  first  equation  of  the  system. 
Solving  the  second  equation  of  the  system  is  next. 

DXE  -  D‘ [A‘CB‘+D'FE'+G‘KH‘-A‘A(D'FE*+G'KH')BB']E 

-  da'cb'e+dd'fe‘e+dg'kh'e-da'ad'fe*bb‘e-da'ag'kh*bb*e 

Making  use  of  hypothesis  ii,  iv,  and  v  of  Eq  (3.67)  yields 
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DXE  «  DA‘CB'E+F+DG*KH‘E-DA'CB*E'DG‘KH*E  -  F 


Following  a  similar  line  of  reasoning,  it  can  easily  be  shown 
that 

GXH  -  G[A'CB*+D'FE*+G‘KH'-A'A(D*FE‘+G'KH')BB']H 

-  ga'cb'h+gd"fe*h+gg*kh‘h-ga*ad*fe'bb'h-ga'ag‘kh‘bb*h 

Making  use  of  hypothesis  iii,  iv  and  v  of  Eq  (3.67)  yields 

gxh  -  ga*cb*h+gd*fe*h+k-ga”cb"h-gd'fe'h 

-  K 

Thus  X  is  a  common  solution  and  the  theorem  is  proved. 

Corollary  3.4.1:  Let  the  matrices  A, B ,C ,D ,E ,F ,G ,H ,  and 
K  be  of  appropriate  dimension.  A  necessary  and  sufficient  con¬ 
dition  that  the  system  of  equations  given  in  Eq  (3.66)  have  a 
common  solution  is  that  items  i,  ii,  iii  of  Eq  (3.67)  hold  and 
that 

vi)  F  -  DG*KH‘E 

.  (3.77) 

vii)  G  KH  -  D  DA  CB  EE  -  A  CB 

are  also  true.  A  common  solution  then  exists  and  is 

X  -  A"CB"+D"FE"+G"KH"-D"D(A'CB'+G"KH')EE'  (3.78) 

Proof:  The  proof  is  identical  to  that  of  the  theorem 
with  only  variable  name  changes. 

Corollary  3.4.2:  Let  the  matrices  A,B,C,D,E,F,G,H, 
and  K  be  of  appropriate  dimension.  A  necessary  and  sufficient 
condition  that  the  system  of  equations  given  in  Eq  (3.66)  have 
a  common  solution  is  that  item  i,  ii,  iii  of  Eq  (3.67)  hold 
and  that 
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viii) 

ix) 


K  =  GA  CB  H 


(3.79) 


d‘fe'  -  g'gd’fe’hh*  =  a'cb” 

are  also  true.  A  common  solution  then  exists  and  is 

X  =  A‘CB'+D‘FE'+G'KH'-G‘G(A"CB"+D'FE")HH'  (3.80) 

Proof:  A  proof  is  identical  to  that  of  the  theorem 
with  only  variable  name  changes. 

Example  3.3:  Solve  for  the  common  solution  of  the 
system  of  equations  as  given  in  Eq  (3.66)  where  A,  B,  C,  D, 

E,  and  F  are  as  given  in  Eq  (3.42).  Let  G,  H,  and  K  be  defined 
as  follows 


i  i  i  n 
1111 
1111 
•1111 


,  H 


■  f1  1 T . 

Li  l  lJ 


and  K 


r8 

8 

8‘ 

8 

8 

8 

8 

8 

8 

L8 

8 

8-1 

(3.82) 


To  see  if  a  common  solution  is  possible,  the  conditions  of 
the  theorem  must  be  checked.  First  find  the  necessary 
generalized  inverses.  A'  and  B"  are  as  given  in  Eq  (3.45) 
and  the  other  inverses  will  be  found  by  using  the  method  of 
Noble  (Ref  30:342),  therefore 


n 

i 

i 

Ll 


1“I 

1 

1  ’ 
1-1 


(y)[l  1]  ,  G' 


(K) 


n 

i 

i 

Ll 


i 

i 

i 

i 


i 

i 

i 

ij 


(3.83) 


Condition  i. 


1  r1  X1 

and  H  *  (±)  1  1 

6  ll  lJ 


*g  ■ !  an !  |]e  in: ; ;  :i  * 

[iil 


Condition  ii. 


DD  FE  E 


P  1 

U  i 


i  n 

i  i 
i  i 
Ll  ij 


[i  i] 


[!]  ■  [I] 


Condition  iii. 


GG  KH'H 


(A) 


n 

1 

1 

u 


1 

1 

1 

1 


rs  8 
8  8  8 
8  8  8 
8  8  8 


1 

1 

1 

1 


n 

1 

1 

1J 


n 

i 

i 

Ll 


1 

1 

1 

1 


1 

1 

1 

1 


n 

i 

i 

1J 


g  lie :  3* 


Condition  iv. 


C  -  AD  FE  B 


i  r1  1 

(»}  li  i 

[l  il 

L8  8  J 


1 

1 

1 


1 

1 

1J 


n  n 

i  i 
i  i 
i  i-i 


L8]  ti  u  [\  J]  $ 


Condition  v. 
g”kh"  -  D*FE" 
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D  FE  was  evaluated  in  Eq  (3.48)  and  is  equal  to 


G  KH 


(T^ 


1 

1 

1 

Li 


l 

l 

l 

l 


‘8 

8 

8 

L8 


8 

8 

8 

8J 


1 

1 

LI 


D* 


n 

i 

i 

n 


i  i 

i  i 

i  i 

.1  u 


(3.84) 


Thus  all  the  conditions  of  the  theorem  are  met  and  the  common 
solution  is  given  by  Eq  (3.68).  Making  substitutions  from 
Eqs  (3.47),  (3.48),  and  (3.84)  yields 


‘l  r 

'i  r 

ri  r 

i  i 

i  i 

i  i 

i  i 

i  i 

i  i 

,i  i 

Li  iJ 

.i  i. 

(3T> 


1  1 
1  1 
1  1 
1  1 


3 

3 

3 

3J 


1111 

1111 

1111 


1  1 
1  1 
1  1 
1 


J\Li 


3  3  3  3 
3  3  3  3 
3  3  3  3 
3  3  3  3. 


[l1  1]  [1  1]  c}) 


G  z2] 


‘3  3' 

‘2  1' 

'i  r 

3  3 

2  2 

■ 

i  i 

3  3 

2  2 

i  i 

1.3  3j 

z  2 

i  i 

(3.85) 


Thus  a  common  solution  exists  and  is  equal  to  the  last  matrix 
of  Eq  (3.85). 


Linear  Modeling 


The  theory  developed  in  the  section  on  generalized  in¬ 
verses  will  be  used  to  help  in  the  estimation  of  some  of  the 


components  of  linear  models.  Variance  and  estimation  error 
have  been  studied  with  regard  to  Reduced  Order  Filters  by 
Asher  (Ref  1)  and  the  estimation  of  both  variance  and  co- 
variance  have  been  considered  by  Rao  (Ref  34). 

A  linear  model  is  one  that  can  be  expressed  as 

Y  »  XB  +  \J1^1  *  ...  *  Uk£k  (3.86) 

where  Y  is  an  n- vector  of  random  variables,  X  is  a  given 

n  x  m  matrix,  8  is  an  m- vector  of  unknown  parameters,  the 

U^'s  are  given  n  x  c^  matrices,  and  the  K^  is  a  c^- vector 

of  uncorrelated  random  variables  with  a  zero  mean  value  and 

2 

a  dispersion  matrix  a. I  ,  i  *  l....k  where  the  variances  arc 

1  c^ 

unknown . 

An  alternate  method  of  expressing  Eq  (3.86)  is  as 

Y  =  X8  +  U£  (3.87) 

where  U  =  (l^ ;  U£ ;  . . .  ;  Uk)  and  £  T  =  (5 J ;  k\\  . . .  ;  5k) . 

To  estimate  the  variance  components  of  the  linear 
function 

+  ...  +  Pk<?k  (3.88) 

T 

the  quadratic  function  Y  AY  of  the  random  variable  Y  in  Eq 
(3.86)  or  (3.87)  will  be  used.  To  find  this  matrix  A,  some 
criteria  will  need  to  be  developed.  First  the  translation  of 
the  8  parameter  should  be  invariant.  Thus  consider  Eq  (3.87) 
which  can  be  written  as 

Y  -  X80  -  xe  -  X80  ♦  U£  (3.89) 


83 


Let  X8  -  XBq  ■  Xy,  then  the  estimator  of  Eq  (3.88)  becomes 

(Y  -  X8q)TA(Y  -  X8q)  (3.90) 

Since  the  variance  should  be  as  close  to  zero  as  possible, 

Eq  (3.90)  should  be  set  equal  to  zero.  Expanding  yields 

YTAY  -  (X80)TAY  -  YTAX  q  -  (XBq)TAX  0  -  0  (3.91) 


If  B  is  to  remain  invariant  under  translation,  then  from 
Eq  (3.91)  AX  -  0. 

A  second  criteria  is  that  the  estimate  be  unbiased. 
Using  the  restriction  that  AX  -  0,  the  estimate  can  be  ex¬ 
pressed  as 

YTAY  -  £TUTAU£  (3.92) 


which  is  an  expression  in  terms  of  the  hypothetical  vector 

variable  S.  If  Eq  (3.92)  is  unbiased  for  Eq  (3.88),  for 
2 

all  a  ^ ,  then 


E(£TUTAUO 


k 

r 


i.lE(^U>iSi)’ 


2  T 

E  of  tr  ufAU. 

i-1  1  11 


(3.93) 


However  Eq  (3.93)  is  another  expression  for  Eq  (3.88).  Thus 
this  implies  that 

tr  uTAUi  -  Tpi  (3.94) 


for  all  i  ■  1 . k. 

The  third  criteria  is  that  of  minimum  norm.  This  says 


that  if  the  hypothetical  variable  5  were  known,  then  a  natural 
estimator  of  Eq  (3.88)  is 


(3.95) 


(p1/c1)C^1  +  ...  +  (Pk/ckKk5k  XST° 

where  D  is  an  arbitrary  diagonal  matrix.  Thus  there  presently 
exists  two  estimators,  the  estimator  of  Eq  (3.92)  and  the 
estimator  of  Eq  (3.95).  Taking  the  difference  yields 

STUTAU£  -  5TDC  *  ST(UTAU  -  D)C  (3.96) 

This  difference  can  then  be  made  small,  in  some  sense,  by 
taking  the  norm  of  the  matrix 

||UTAU  -  D ||  (3.97) 

The  norm  of  Eq  (3.97)  can  be  any  acceptable  norm  that  satis¬ 
fies  the  properties  of  a  norm  (Ref  14:198).  One  choice  of 
norm  is  the  Euclidean  norm  defined  as 

II UTAU  -  D II2  =  tr  (UTAU  -  D)(UTUA  -  D)  (3.98) 


Thus  the  problem  of  finding  the  Minimum  Norm  Quadratic  Unbiased 
Estimator  (MINQUE)  of  Eq  (3.95)  is  one  of  finding  a  matrix  A 
such  that  Eq  (3.98)  is  a  minimum  subject  to  the  conditions 


AX  =  0 

tr  AA^  ■  p^  ,  i  ■  1, . . .  k 


(3.99) 


where  *  uTu^. 

With  these  concepts  in  mind,  consider  the  model  given 
in  Eq  (3.86),  where  X  is  a  given  m  x  n  matrix  and  8  is  an 
m-vector  of  unknown  parameter.  is  a  q-vector  such  that 


E(q)  -  0  ,  EtqeT)  -  S  ,  Cov^.,^)  *  0,  itj 


(3.100) 
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The  problem  that  is  now  considered  is  one  of  estimating  the 
q(q+l)/2  components  of  the  symmetric  matrix  S  given  in  Eq 
(3.100)  or  one  of  estimating  the  linear  function  of  S  when 
the  vector  is  unknown.  The  dispersion  of  Eq  (3.86)  is 
given  by 

D(Y)  -  UxSUj  +...+  UkSu£  (3.101) 

The  problem  of  estimating  the  given  linear  function  of  the 
elements  in  S,  which  can  be  written  as 

tr  SQ,  (3.102) 

where  Q  is  an  arbitrary  symmetric  matrix,  can  be  solved  by 
T 

letting  Y  AY  be  an  unbiased  quadratic  estimate  of  Eq  (3.102) 
with  the  restriction  that  AX  =  0.  Thus 

E(YTAY)  -  tr  AD (Y)  =  tr  S(U^AU1+  ...  +  UkAUk)  (3.103) 

Comparing  Eqs  (3.102)  and  (3.103),  it  is  obvious  that 

Q  =■  U^AU1  +  ...  +  u£aUr  (3.104) 

If  the  ...,  £k  are  known,  then  a  natural  choice  for  esti¬ 
mator  of  S  is 

(l/kH^  +  ...  ♦  £k£k)  (3.10S) 

and  a  natural  choice  for  the  estimator  of  tr  SQ  is 
tr  (1/k )(^51  5k5k)Q  -  (l/k)(^Qi  ♦...+ekQk)  (3.106) 

Now  the  estimator  that  was  initially  proposed  is 

YTAY  -  (U1C1  +...♦  Uk5k)TA(U151  ♦...♦  UkCk)  (3.107) 
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Considering  the  estimators  in  Eqs  (3.106)  and  (3.107)  as 
quadratic  expressions  in  £,  then  what  remains  is  to  minimize 
the  norm  of  their  difference.  Finally,  the  problem  is  one 
of  minimizing 

II UTAU II  (3.108) 


subject  to  the  conditions 


AX  =  0 
k  T 

I  U-AU.  =  Q 
i=l  1  1 


(3.109) 


The  preceding  development  is  due  to  Rao  (Ref  34) .  The  methods 
of  the  preceding  section  are  utilized  in  the  finding  of  the 
matrix  A  in  Eqs  (3.108)  and  (3.109). 

Example  3.4:  Consider  a  linear  model  of  the  type  in 
Eq  (3.87)  where  Y  is  an  n-vector  of  random  variables,  8  is 
an  m- vector  of  unknown  parameters,  and  the  matrices  X,  U,  and 
E,  are  defined  as  follows 


r1  i  i 

n  ri 

0  110  1 

1 

0  n 

X  -  1  1  1 

10  0  10 

0 

1  0 

liii 

lJ  U 

0  110  1 

1 

0  lJ 

*[100 

0  10  0  0 

1} 

(3.110) 

The 

matrix  U  can 

be  expressed  as 

r1  0 

110  1 

1 

o  n 

U 

-  [U.  :  U-  : 

U,]  »  0  1 

0:0  1  0  : 

0 

1  0 

1  L 

3  ll  0 

1  1  10  1 

1 

0  lJ 

and 

5T 

■  [I*  :  Cj  : 

C3]  -  [(l  0 

0)T  :  (0  1  0)T 

: 

(0  0  1)T] 
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The  problem  at  hand  is  to  solve  Eq  (3.108)  subject  to  the  con¬ 
ditions  of  Eq  (3.109).  Thus  the  equations  to  be  solved  are 


and 


r 1  i  1  i“i 

A  1  1  1  1  =0  (3.111) 

Ll  1  1  lJ 


3 

E 

i=l 


T 

U  AU 


Q 


Since  Q  is  an  arbitrarily  chosen  matrix,  let  Q  be  defined  as 


Q  * 


2 

-2 

2 


The  procedures  of  the  last  section  will  be  employed  to  solve 
the  above  system  of  equations.  From  Penrose  (Ref  31)  the 
equation  AX  =  0  can  be  solved  by  use  of  the  formula 


A  =  cX'  +  W(XX"  -  I) 


(3.112) 


where  c  is  a  constant  matrix  and  W  is  arbitrary. 

Again  applying  Noble  (Ref  30:342)  to  find  the  inverse 
of  X,  the  generalized  inverse  of  X  is 


X 


1 

1 

1 

1 


r 

l 

l 

i-l 


(3.113) 


Solving  Eq  (3.112)  yields 


A 


x 


OX 


1  1 
1  1 
1  1 


i  n 

l  l 
l  i 
l  l 
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(3.114) 


Since  W  is  arbitrary,  let  W  be  the  identity.  Thus  A  is 


(3.115) 


The  next  step  is  to  solve 

3  .p 

E  ITAU  =  Q  (3.116) 

i=i 

Since  each  of  the  IT  are  the  same,  the  equation  above  could 
be  written  as 


3UTAU  =  Q 


(3.117) 


Applying  Theorem  2.11  to  solve  for  A  yields 

A  =  (1/3)UT'QUT  ♦  Z  -  UT'UTZUU' 


(3.118) 


T  t~ 

Notice  that  U  =  U  ,  thus  U  »  U  and  this  value  is 


T  fl  0  1 

U'  =  IT  =  (1/4)  040 

Ll  0  1 


(3.119) 


In  Eq  (3.118),  since  the  end  result  is  a  common  solution,  let 
the  arbitrary  matrix  Z  equal  Ax<  Solving  Eq  (3.118)  yields 
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END 


3-80 


a.(Ku},[o  j  o][;i-|1][o  ;  ;]* 
-  5°  »][|  -\  ,|][|  s  °] 


(3.120) 


In  Eq  (3.120)  the  quantity 


if2  0  2] 

(t)  0  4  0 

*  12  0  2  J 


is  the  product  of  UT*UT  -  UU'.  Eq  (3.120)  simplifies  to 


l  r-8  16-81  ,  f-2  1  11 

'  ‘to  US  II  “  r  ( 1 1  1  .\\ 

,  r-8  16  -81 

•  ^  US  1!  ?,6J 


(3.121) 


*  <*>  [1  -2  il 

L  1  1  -2  J 


Thus  a  common  solution  has  been  arrived  at.  With  the  value 
of  A  now  known,  it  is  a  simple  matter  to  minimize  the 
quantity  ||UTAU||. 
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IV.  Conclusions  and  Recommendations 


The  increasing  use  of  matrix  equations  in  the  engineer¬ 
ing  sciences  has  stimulated  a  rapidly  growing  interest  in  how 
to  best  solve  these  equations.  The  techniques  developed  in 
this  thesis  can,  depending  on  the  equation,  be  tedious  to  do 
by  hand,  but  all  can  be  coded  for  computer  application. 

Several  different  methods  of  solution  have  also  been  presented 
so  that  if  one  method  fails  to  yield  acceptable  results, 
another  way  may  be  implemented  that  will  in  turn  be  satisfac¬ 
tory. 

Additional  attention  can  be  paid  to  the  area  of  quad¬ 
ratic  matrix  equations.  Some  of  the  techniques  that  have 
been  discussed  may  prove  to  be  of  value  in  the  solution  of 
this  type  equation. 
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